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® Humanised antibodies. 

@ CDR-grafted antibody heavy and light chains comprise acceptor framework and donor antigen binding 
regions, the heavy chains comprising donor residues at at least one of positions (6, 23) and/or (24, 48) and/or 
(49, 71) and/or (73, 75) and/or (76) and/or (78) and/or (91). The CDR-grated light chains comprise donor residues 
at at least one of positions (1) and/or (3) and (46) and/or (47) or at at least one of positions (46, 48. 58) and (71). 
The CDR-grafted antibodies are preferably humanised antibodies, having non-human, e.g. rodent, donor residues 
and human acceptor frameworks, and may be used for in vivo therapy and diagnosis. A generally applicable 
protocol is disclosed for obtaining CDR-grafted antibodies. 
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1 CAATTCCCAA AGACAAA atq oatLt:t:te«AO toeaQatititit eaaefctieeta 

51 ctaateaata cctcaotcat: aafcatceaaa QQacaaatto ttctcaccca 

101 9tctccagca atcatgtctg ca^ctceagg ggagaaggic acca^gacct 

151 gcag^gccag c^caagtgta agttacatga actggt.acca gcagaagtca 

201 ggcacctccc ccaaaagatg gatttatgac acatccaaac tggct^ct.gg 

■251 agtccctgct cacttcaggg gcagtgggtc tgggacctct tactc^c^ca 

301 caatcagcgg catggaggct gaagatgctg ecadtatta c^gccagcag 

3 51 ^ggagtagta acccattcae g^tcggctcg gggacaaag^ tggaaataaa 

4 01 ccgggctgat ac^caccaa ctgtatccat cttcccacca tccagtgagc 
451 ag^taacatc tggaggtgcc tcag^cgtgt gct^ct^gaa caac^tctac 
501 cccaaagaca t:caa'tgt;caa gtggaagatt gatggcagtg aacgacaaaa 
551 tggcgtcctg aacagttgga ctga^cagga cagcaaagac agcacctaca 
601 gcatgagcag cacccteacg ^tgaccaagg acgagtatga acgacataac 
651 agctatacct. g^gaggccac tcacaagaca tcaacttcac ccattg^caa 
701 gagct^caac aggaatgagt gtTAGAGACA AAGGTCCTGA GACGCCACCA 
751 CCAGCTCCCA CCTCCATCCT . ATCTTCCCTT CTAAGGTCTT GGAGGCTTCC 
801 CCACAAGCGC tTACCACTCT TGCGGTCCTC tAAACCTCCT CCCACCTCCT 
851 TCTCCTCCTC CTCCCTTTCC TTGCCTTTTA TCATGCTAAT ATTTGCACAA 
901 AATATTCAAT AAACTGAGTC TTTGCCTTGA AAAAAAAAAA AAA 

Fig. Ua) 



1 MpypvoTrsr I.LISASVTTS RG OTVLTOSP AIHSA5P6EK VTHTCSASSS 

51 VSYKNWYQQK SGTSPKRWZY DTSKUVSGVP AHFRGSCSCT SYSLTI5CME 

101 AEDAATYYCQ QWSSNPrTFG SGTKLEIKKA DTAPTV5IFP PSSEQLTSGG 

151 A5WCFLNNF YPKDIHVKVK IDGSEPQNGV LN5KTOQDSK DSTYSMSSTL 

201 TLTKDEYERH NSYTCEATHK TSTSPIVKSF NPNEC* 

Fig. Kb) 
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Field of the Invention 

The present invention relates to humanised antibody molecules, to processes for their production using 
recombinant DNA technology, and to their therapeutic uses. 
5 The term "humanised antibody molecule'* in used to describe a molecule having an antigen binding site 
derived from an immunoglobulin from a non-human species, and remaining immunoglobulin-derlved parts of 
the molecule being derived from a human immunoglobulin. The antigen binding site typically comprises 
complementarity determining regions (CDRs) which determine the binding specificity of the antibody 
molecule and which are carried on appropriate framework regions in the variable domains. There are 3 
10 CDRs (CDRl , CDR2 and CDR3) in each of the heavy and light chain variable domains. 

In the description, reference is made to a number of publications by number. The publications are listed 
in numerical order at the end of the description. 

Background of the Invention 

75 

Natural immunoglobulins have been known for many years, as have the various fragments thereof, such 
as the Fab, (Fab')2 and Fc fragments, which can be derived by enzymatic cleavage. Natural im- 
munoglobulins comprise a generally Y-shaped molecule having an antigen-binding site towards the end of 
each upper arm. The remainder of the structure, and partlculariy the stem of the Y, mediates the effector 

20 functions associated with immunoglobulins. 

Natural immunoglobulins have been used in assay, diagnosis and, to a more limited extent, therapy. 
However, such uses, especially in therapy, were hindered until recently by the polyclonal nature of natural 
immunoglobulins. A significant step towards the realisation of the potential of immunoglobulins as therapeu- 
tic agents was the discovery of procedures for the production of monoclonal antibodies (MAbs) of defined 

25 specificity (1). 

However, most MAbs are produced by hybridomas which are fusions of rodent spleen cells with rodent 
myeloma cells. They are therefore essentially rodent proteins. There are very few reports of the production 

of human MAbs. 

Since most available MAbs are of rodent origin, they are naturally antigenic in humans and thus can 
30 give rise to an undesirable immune response termed the HAMA (Human Anti-Mouse Antibody) response. 
Therefore, the use of rodent MAbs as therapeutic agents in humans is inherently limited by the fact that the 
human subject will mount an immunological response to the MAb and will either remove it entirely or at 
least reduce Its effectiveness. In practice, MAbs of rodent origin may not be used in patients for more than 
one or a few treatments as a HAMA response soon develops rendering the MAb ineffective as well as 
35 giving rise to undesirable reactions. For instance, 0KT3 a mouse lgG2a/k MAb which recognises an antigen 
in the T-cell receptor-CD3 complex has been approved for use in many countries throughout the worid as 
an immunosuppressant in the treatment of acute allograft rejection [Chatenoud et al (2) and Jeffers et al (3)- 
]. However, in view of the rodent nature of this and other such MAbs, a significant HAMA response which 
may include a major anti-idiotype component, may build up on use. Clearly, it would be highly desirable to 
40 diminish or abolish this undesirable HAMA response and thus enlarge the areas of use of these very useful 
antibodies. 

Proposals have therefore been made to render non-human MAbs less antigenic in humans. Such 
techniques can be generically termed "humanisation" techniques. These techniques typically involve the 
use of recombinant DNA technology to manipulate DNA sequences encoding the polypeptide chains of the 
46 antibody molecule. 

Eariy methods for humanising MAbs involved production of chimeric antibodies in which an antigen 
binding site comprising the complete variable domains of one antibody is linked to constant domains 
derived from another antibody. Methods for carrying out such chimerisation procedures are described in 
EP01 20694 (Celltech Limited). EP01 25023 (Genentech Inc. and City of Hope), EP-A-0 171496 (Res. Dev. 

50 Corp. Japan), EP-A-0 173 494 (Stanford University), and WO 86/01533 (Celltech Limited). This latter 
Celltech application (WO 86/01533) discloses a process for preparing an antibody molecule having the 
variable domains from a mouse MAb and the constant domains from a human immunoglobulin. Such 
humanized chimeric antibodies, however, still contain a significant proportion of non-human amino acid 
sequence, i.e. the complete non-human variable domains, and thus may still elicit some HAMA response, 

55 partlculariy if administered over a prolonged period (Begent et al (ref. 4)]. 

In an alternative approach, described in EP-A-0239400 (Winter), the complementarity determining 
regions (CDRs) of a mouse MAb have been grafted onto the framework regions of the variable domains of a 
human immunoglobulin by site directed mutagenesis using long oligonucleotides. The present invention 
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relates to humanized antibody molecules prepared according to this alternative approach, i.e. CDR-grafted 
humanised antibody molecules. Such CDR-grafted humanized antibodies are much less likely to give rise to 
a HAMA response than humanised chimeric antibodies in view of the much lower proportion of non-human 
amino acid sequence which they contain. 

5 The earliest work on humanizing MAbs by CDR-grafting was carried out on MAbs recognizing synthetic 
antigens, such as the NP or NIP antigens. However, examples in which a mouse MAb recognizing lysozyme 
and a rat MAb recognising an antigen on human T-cells were humanised by CDR-grafting have been 
described by Verhoeyen et al (5) and Riechmann et al (6) respectively. The preparation of CDR-grafted 
antibody to the antigen on human T cells is also described in WO 89/07452 (Medical Research Council). 

10 In Riechmann et al /Medical Research Council it was found that transfer of the CDR regions alone [as 
defined by Kabat refs. (7) and (8)] was not sufficient to provide satisfactory antigen binding activity in the 
CDR-grafted product. Riechmann et ai found that it was necessary to convert a serine residue at position 27 
of the human sequence to the corresponding rat phenylalanine residue to obtain a CDR-grafted product 
having improved antigen binding activity. This residue at position 27 of the heavy chain is within the 

76 Structural loop adjacent to CDR1. A further construct which additionally contained a human serine to rat 
tyrosine change at position 30 of the heavy chain did not have a significantly altered binding activity over 
the humanised antibody with the serine to phenylalanine change at position 27 alone. These results indicate 
that changes to residues of the human sequence outside the CDR regions, in particular in the structural 
loop adjacent to CDR1, may be necessary to obtain effective antigen binding activity for CDR-grafted 

20 antibodies which recognise more complex antigens. Even so the binding affinity of the best CDR-grafted 
antibodies obtained was still significantly less than the original MAb. 

Very recently Queen et al (9) have described the preparation of a humanised antibody that binds to the 
interleukin 2 receptor, by combining the CDRs of a murine MAb (anti-Tac) with human immunoglobulin 
framework and constant regions. The human framework regions were chosen to maximise homology with 

25 the anti-Tac MAb sequence. In addition computer modelling was used to identify framework amino acid 
residues which wore likely to interact with the CDRs or antigen, and mouse amino acids were used at these 
positions in the humanised antibody. 

In WO 90/07861 Queen et al propose four criteria for designing humanised immunoglobulins. The first 
criterion is to use as the human acceptor the framework from a particular human immunoglobulin that is 

30 unusually homologous to the non-human donor immunoglobulin to be humanised, or to use a consensus 
framework from many human antibodies. The second criterion is to use the donor amino acid rather than 
the acceptor if the human acceptor residue is unusual and the donor residue is typical for human 
sequences at a specific residue of the framework. The third criterion is to use the donor framework amino 
acid residue rather than the acceptor at positions immediately adjacent to the CDRs. The fourth criterion is 

35 to use the donor amino acid residue at framework positions at which the amino acid is predicted to have a 
side chain atom within about 3 A of the CDRs in a three-dimensional immunoglobulin model and to be 
capable of interacting with the antigen or with the CDRs of the humanised immunoglobulin. It is proposed 
that criteria two, three or four may be applied in addition or alternatively to criterion one, and may be 
applied singly or in any combination. 

40 WO 90/07861 describes in detail the preparation of a single CDR-grafted humanised antibody, a 
humanised antibody having specificity for the p55 Tac protein of the IL-2 receptor. The combination of all 
four criteria, as above, were employed in designing this humanized antibody, the variable region frame- 
works of the human antibody Eu (7) being used as acceptor. In the resultant humanised antibody the donor 
CDRs were as defined by Kabat et al (7 and 8) and in addition the mouse donor residues were used in 

45 place of the human acceptor residues, at positions 27, 30. 48, 66. 67, 89, 91. 94, 103. 104, 105 and 107 in 
the heavy chain and at positions 48, 60 and 63 in the light chain, of the variable region frameworks. The 
humanised anti-Tac antibody obtained is reported to have an affinity for p55 of 3 x 10^ M~\ about one-third 
of that of the murine MAb. 

We have further investigated the preparation of CDR-grafted humanised antibody molecules and have 
50 identified a hierarchy of positions within the framework of the variable regions (i.e. outside both the Kabat 
CDRs and structural loops of the variable regions) at which the amino acid identities of the residues are 
important for obtaining CDR-grafted products with satisfactory binding affinity. This has enabled us to 
establish a protocol for obtaining satisfactory CDR-grafted products which may be applied very widely 
irrespective of the level of homology between the donor immunoglobulin and acceptor framework. The set 
55 of residues which we have identified as being of critical importance does not coincide with the residues 
identified by Queen et al (9). 
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Summary of the Invention 

Accordingly, in a first aspect the invention provides a CDR-grafted antibody heavy chain having a 
variable region domain comprising acceptor framework and donor antigen binding regions wherein the 
5 framework comprises donor residues at at least one of positions 6. 23 and/or 24, 48 and/or 49, 71 and/or 
73. 75 and/or 76 and/or 78 and 88 and/or 91. 

In preferred embodiments, the heavy chain framework comprises donor residues at positions 23. 24, 49. 
71. 73 and 78 or at positions 23. 24 and 49. The residues at positions 71, 73 and 78 of the heavy chain 
framework are preferably either all acceptor or all donor residues. 
10 In particularly preferred embodiments the heavy chain framework additionally comprises donor residues 
at one, some or all of positions 6. 37. 48 and 94. Also it is particularly preferred that residues at positions of 
the heavy chain framework which are commonly conserved across species, i.e. positions 2, 4, 25, 36, 39, 
47, 93, 103, 104, 106 and 107, If not conserved between donor and acceptor, additionally comprise donor 
residues. Most preferably the heavy chain framework additionally comprises donor residues at positions 2; 
15 4. 6, 25, 36. 37, 39, 47, 48. 93. 94, 103. 104, 106 and 107. 

In addition the heavy chain framework optionally comprises donor residues at one, some or all of 
positions: 
1 and 3, 
72 and 76. 

20 69 (If 48 Is different between donor and acceptor), 

38 and 46 (if 48 is the donor residue). 
80 and 20 (if 69 is the donor residue), 
67. 

82 and 1 8 (if 67 is the donor residue), 
25 91, 
88. and 

any one or more of 9. 11, 41, 87, 108. 110 and 112. 

In the first and other aspects of the present Invention reference is made to CDR-grafted antibody 
products comprising acceptor framework and donor antigen binding regions. It will be appreciated that the 
30 invention is widely applicable to the CDR-grafting of antibodies in general. Thus, the donor and acceptor 
antibodies may be derived from animals of the same species and even same antibody class or sub-class. 
More usually, however, the donor and acceptor antibodies are derived from animals of different species. 
Typically the donor antibody Is a non-human antibody, such as a rodent MAb, and the acceptor antibody is 
a human antibody. 

35 In the first and other aspects of the present invention, the donor antigen binding region typically 
comprises at least one CDR from the donor antibody. Usually the donor antigen binding region comprises 
at least two and preferably all three CDRs of each of the heavy chain and/or light chain variable regions. 
The CDRs may comprise the Kabat CDRs, the structural loop CDRs or a composite of the Kabat and 
structural loop CDRs and any combination of any of these. Preferably, the antigen binding regions of the 

40 CDR-grafted heavy chain variable domain comprise CDRs corresponding to the Kabat CDRs at CDR2 
(residues 50-65) and CDR3 (residues 95-100) and a composite of the Kabat and structural loop CDRs at 
CDR1 (residues 26-35). 

The residue designations given above and elsewhere in the present application are numbered accord- 
ing to the Kabat numbering [refs, (7) and (8)]. Thus the residue designations do not always correspond 

45 directly with the linear numbering of the amino acid residues. The actual linear amino acid sequence may 
contain fewer or additional amino acids than In the strict Kabat numbering corresponding to a shortening of, 
or insertion into, a structural component, whether framework or CDR, of the basic variable domain structure. 
For example, the heavy chain variable region of the anti-Tac antibody described by Queen et al (9) contains 
a single amino acid insert (residue 52a) after residue 52 of CDR2 and a three amino acid insert (residues 

50 82a, 82b and 82c) after framework residue 82, in the Kabat numbering. The correct Kabat numbering of 
residues may be determined for a given antibody by alignment at regions of homology of the sequence of 
the antibody with a "standard" Kabat numbered sequence. 

The invention also provides in a second aspect a CDR-grafted antibody light chain having a variable 
region domain comprising acceptor framework and donor antigen binding regions wherein the framework 

55 comprises donor residues at at least one of positions 1 and/or 3 and 46 and/or 47. Preferably the CDR 
grafted light chain of the second aspect comprises donor residues at positions 46 and/or 47. 

The invention also provides in a third aspect a CDR-grafted antibody light chain having a variable region 
domain comprising acceptor framework and donor antigen binding regions wherein the framework com- 
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prises donor residues at at least one of positions 46. 48, 58 and 71. 

In a preferred ennbodiment of the third aspect, the frameworl< comprises donor residues at all of 
positions 46, 48. 58 and 71 . 

In particularly preferred enn bod indents of the second and third aspects, the framework additionally . 
5 comprises donor residues at positions 36. 44, 47, 85 and 87. Similarly positions of the light chain framework 
which are commonly conserved across species, i.e. positions 2, 4, 6, 35, 49, 62. 64-69, 98. 99, 101 and 
102. if not conserved between donor and acceptor, additionally comprise donor residues. Most preferably 
the light chain framework additionally comprises donor residues at positions 2, 4, 6. 35. 36, 38, 44, 47, 49. 
62, 64-69, 85. 87. 98. 99. 101 and 102. 
10 In addition the framework of the second or third aspects optionally comprises donor residues at one, 
some or all of positions: 
1 and 3, 
63. 

60 (if 60 and 54 are able to form at potential saltbridge), 
75 70 (if 70 and 24 are able to form a potential saltbridge). 
73 and 21 (if 47 is different between donor and acceptor), 
37 and 45 (if 47 is different between donor and acceptor), 
and 

any one or more of 10. 12, 40. 80, 103 and 105. 
20 Preferably, the antigen binding regions of the CDR-grafted light chain variable domain comprise CDRs 
corresponding to the Kabat CDRs at CDR1 (residue 24-34). CDR2 (residues 50-56) and CDR3 (residues 89- 
97), 

The invention further provides in a fourth aspect a CDR-grafted antibody molecule comprising at least 
one CDR-grafted heavy chain and at least one CDR-grafted light chain according to the first and second or 

25 first and third aspects of the invention. 

The humanised antibody molecules and chains of the present invention may comprise: a complete 
antibody molecule, having full length heavy and light chains; a fragment thereof, such as a Fab, (Fab*)2 or 
FV fragment; a light chain or heavy chain monomer or dimer; or a single chain antibody, e.g. a single chain 
FV in which heavy and light chain variable regions are joined by a peptide linker; or any other CDR-grafted 

30 molecule with the same specificity as the original donor antibody. Similarly the CDR-grafted heavy and light 
chain variable region may be combined with other antibody domains as appropriate. 

Also the heavy or light chains or humanised antibody molecules of the present invention may have 
attached to them an effector or reporter molecule. For instance, it may have a macrocycle, for chelating a 
heavy metal atom, or a toxin, such as ricin. attached to it by a covalent bridging structure. Alternatively, the 

36 procedures of recombinant DNA technology may be used to produce an immunoglobulin molecule in which 
the Fc fragment or CH3 domain of a complete immunoglobulin molecule has been replaced by. or has 
attached thereto by peptide linkage, a functional non-immunoglobulin protein, such as an enzyme or toxin 
molecule. 

Any appropriate acceptor variable region framework sequences may be used having regard to 

40 class/type of the donor antibody from which the antigen binding regions are derived. Preferably, the type of 
acceptor framework used is of the same/similar class/type as the donor antibody. Conveniently, the 
framework may be chosen to maximise/optimise homology with the donor antibody sequence particularly at 
positions close or adjacent to the CDRs. However, a high level of homology between donor and acceptor 
sequences is not important for application of the present invention. The present invention identifies a 

46 hierarchy of framework residue positions at which donor residues may be important or desirable for 
obtaining a CDR-grafted antibody product having satisfactory binding properties. The CDR-grafted products 
usually have binding affinities of at least 10^ M-\ preferably at least about 10^ or especially in the 
range 10^-10^^ in principle, the present invention is applicable to any combination of donor and 

acceptor antibodies irrespective of the level of homology between their sequences. A protocol for applying 

60 the invention to any particular donor-acceptor antibody pair is given hereinafter. Examples of human 
frameworks which may be used are KOL, NEWM. REI. EU. LAY and POM (refs. 4 and 5) and the like; for 
instance KOL and NEWM for the heavy chain and REI for the light chain and EU. LAY and POM for both 
the heavy chain and the light chain. 

Also the constant region domains of the products of the invention may be selected having regard to the 

56 proposed function of the antibody in particular the effector functions which may be required. For example, 
the constant region domains may be human IgA, IgE, IgG or IgM domains. In particular, IgG human 
constant region domains may be used, especially of the IgGI and lgG3 isotypes, when the humanised 
antibody molecule is intended for therapeutic uses, and antibody effector functions are required. Alter- 
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natively, lgG2 and lgG4 isotypes may be used when the humanised antibody molecule Is Intended for 
therapeutic purposes and antibody effector functions are not required, e.g. for simple blocking of lym- 
phokine activity. 

However, the remainder of the antibody molecules need not comprise only protein sequences from 
5 Immunoglobulins. For instance, a gene may be constructed In which a DNA sequence encoding part of a 
human immunoglobulin chain Is fused to a DNA sequence encoding the amino acid sequence of a 
functional polypeptide such as an effector or reporter molecule. 

Preferably the CDR-grafted antibody heavy and light chain and antibody molecule products are 
produced by recombinant DNA technology. 
10 Thus in further aspects the invention also includes DNA sequences coding for the CDR-grafted heavy 
and light chains, cloning and expression vectors containing the DNA sequences, host cells transformed with 
the DNA sequences and processes for producing the CDR-grafted chains and antibody molecules 
comprising expressing the DNA sequences in the transformed host cells. 

The general methods by which the vectors may be constructed, transfection methods and culture 
15 methods are well known per se and form no part of the invention. Such methods are shown, for Instance, in 
references 1 0 and 1 1 . 

The DNA sequences which encode the donor amino acid sequence may be obtained by methods well 
known in the art. For example the donor coding sequences may be obtained by genomic cloning, or cDNA 
cloning from suitable hybridoma cell lines. Positive clones may be screened using appropriate probes for 

20 the heavy and light chain genes In question. Also PCR cloning may be used. 

DNA coding for acceptor, e.g. human acceptor, sequences may be obtained in any appropriate way. 
For example DNA sequences coding for preferred human acceptor frameworks such as KOL, REI, EU and 
NEWM, are widely available to workers in the art. 

The standard techniques of molecular biology may be used to prepare DNA sequences coding for the 

25 CDR-grafted products. Desired DNA sequences may be synthesised completely or In part using 
oligonucleotide synthesis techniques. Site-directed mutagenesis and polymerase chain reaction (PCR) 
techniques may be used as appropriate. For example oligonucleotide directed synthesis as described by 
Jones et al (ref. 20) may be used. Also oligonucleotide directed mutagenesis of a pre-exising variable 
region as. for example, described by Verhoeyen et al (ref. 5) or Riechmann et al (ref. 6) may be used. Also 

30 enzymatic filling in of gapped oligonucleotides using 'T^ DNA polymerase as, for example, described by 
Queen et al (ref. 9) may be used. 

Any suitable host cell/vector system may be used for expression of the DNA sequences coding for the 
CDR-grafted heavy and light chains. Bacterial e.g. E. coli , and other microbial systems may be used, In 
particular for expression of antibody fragments such as FAb and (Fab')2 fragments, and especially FV 

35 fragments and single chain antibody fragments e.g. single chain FVs. Eucaryotic e.g. mammalian host cell 
expression systems may be used for production of larger CDR-grafted antibody products, including 
complete antibody molecules. Suitable mammalian host cells include CHO cells and myeloma or hybridoma 
cell lines. 

Thus. In a further aspect the present invention provides a process for producing a CDR-grafted antibody 
40 product comprising: 

(a) producing In an expression vector an operon having a DNA sequence which encodes an antibody 
heavy chain according to the first aspect of the invention; 

and/or 

(b) producing in an expression vector an operon having a DNA sequence which encodes a complemen- 
45 tary antibody light chain according to the second or third aspect of the invention; 

(c) transfecting a host cell with the or each vector; and 

(d) culturing the transfected cell line to produce the CDR-grafted antibody product. 

The CDR-grafted product may comprise only heavy or light chain derived polypeptide, In which case 
only a heavy chain or light chain polypeptide coding sequence is used to transfect the host cells. 

60 For production of products comprising both heavy and light chains, the cell line may be transfected with two 
vectors, the first vector may contain an operon encoding a light chain-derived polypeptide and the second 
vector containing an operon encoding a heavy chain-derived polypeptide. Preferably, the vectors are 
Identical, except in so far as the coding sequences and selectable markers are concerned, so as to ensure 
as far as possible that each polypeptide chain is equally expressed. Alternatively, a single vector may be 

55 used, the vector Including the sequences encoding both light chain- and heavy chain-derived polypeptides. 

The DNA In the coding sequences for the light and heavy chains may comprise cDNA or genomic DNA 
or both.^ However, it is preferred that the DNA sequence encoding the heavy or light chain comprises at 
least partially, genomic DNA, preferably a fusion of cDNA and genomic DNA. 

7 



EP000620276 rfile:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\EP000620276.CPCl 



Pa g e 8 of 66 



EP 0 620 276 A1 

Th© present invention is applicable to antibodies of any appropriate specificity. Advantageously, 
however, the invention may be applied to the humanisation of non-human antibodies which are used for in 
vivo therapy or diagnosis. Thus the antibodies may be site-specific antibodies such as tumour-specific or 
cell surface-specific antibodies, suitable for use in in vivo therapy or diagnosis, e.g. tumour imaging. 

5 Examples of cell surface-specific antibodies are anti-T cell antibodies, such as anti-CD3, and CD4 and 
adhesion molecules, such as CR3, ICAM and ELAM. The antibodies may have specificity for interleukins 
(including lymphokines, growth factors and stimulating factors), hormones and other biologically active 
compounds, and receptors for any of these. For example, the antibodies may have specificity for any of the 
following: Interferons a, )S, y or 8, IL1. IL2, IL3. or IL4. etc.. TNF. GCSF. GMCSF. EPO, hGH, or insulin, etc. 

10 The the present invention also includes therapeutic and diagnostic compositions comprising the CDR- 
grafted products of the invention and uses of such compositions in therapy and diagnosis. 

Accordingly in a further aspect the invention provides a therapeutic or diagnostic composition compris- 
ing a CDR-grafted antibody heavy or light chain or molecule according to previous aspects of the invention 
in combination with a pharmaceutically acceptable carrier, diluent or excipient. 

75 Accordingly also the Invention provides a method of therapy or diagnosis comprising administering an 
effective amount of a CDR-grafted antibody heavy or light chain or molecule according to previous aspects 
of the invention to a human or animal subject. 

A preferred protocol for obtaining CDR-grafted antibody heavy and light chains in accordance with the 
present invention is set out below together with the rationale by which we have derived this protocol. This 

20 protocol and rationale are given without prejudice to the generality of the invention as hereinbefore 
described and defined. 

Protocol 

25 It is first of all necessary to sequence the DNA coding for the heavy and light chain variable regions of 
the donor antibody, to determine their amino acid sequences. It is also necessary to choose appropriate 
acceptor heavy and light chain variable regions, of known amino acid sequence. The CDR-grafted chain is 
then designed starting from the basis of the acceptor sequence. It will be appreciated that In some cases 
the donor and acceptor amino acid residues may be identical at a particular position and thus no change of 

30 acceptor framework residue is required. 

1 . As a first step donor residues are substituted for acceptor residues in the CDRs. For this purpose the 
CDRs are preferably defined as follows: 

Heavy chain - CDR1 : residues 26-35 

- CDR2: residues 50-65 
35 - CDRS: residues 95-102 

Light chain - CDR1 : residues 24-34 

- CDR2: residues 50-56 

- CDR3: residues 89-97 

The positions at which donor residues are to be substituted for acceptor in the framework are then 
40 chosen as follows, first of all with respect to the heavy chain and subsequently with respect to the light 
chain. 

2. Heavy Chain 

2.1 Choose donor residues at all of positions 23, 24, 49, 71, 73 and 78 of the heavy chain or all of 
positions 23, 24 and 49 (71 , 73 and 78 are always either all donor or all acceptor). 
45 2.2 Check that the following have the same amino acid in donor and acceptor sequences, and if not 

preferably choose the donor: 2, 4, 6. 25. 36. 37. 39. 47. 48, 93, 94. 103, 104. 106 and 107. 
2.3 To further optimise affinity consider choosing donor residues at one, some or any of: 

i. 1, 3 

ii. 72, 76 

50 iii. If 48 is different between donor and acceptor sequences, consider 69 

iv. If at 48 the donor residue is chosen, consider 38 and 46 
V. If at 69 the donor residue is chosen, consider 80 and then 20 

vi. 67 

vii. If at 67 the donor residue is chosen, consider 82 and then 18 
55 viii. 91 

ix. 88 

X. 9, 11, 41, 87, 108, 110. 112 

3. Light Chain 
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3.1 Choose donor at 46. 48. 58 and 71 

3.2 Check that the following have the same amino acid in donor and acceptor sequences. If not 
preferably choose donor: 

2, 4. 6. 35. 38. 44, 47, 49. 62, 64-69 Inclusive. 85, 87. 98. 99. 101 and 102 
5 3.3 To further optimise affinity consider choosing donor residues at one. some or any of: 

i. 1. 3 

ii. 63 

Hi. 60, if 60 and 54 are able to form potential saltbridge 
iv. 70, if 70 and 24 are able to form potential saltbridge 
10 V. 73, and 21 if 47 is different between donor and acceptor 

vi. 37, and 45 if 47 is different between donor and acceptor 

vii. 10. 12, 40. 80. 103, 105 

Rationale 

75 

In order to transfer the binding site of an antibody into a different acceptor framework, a number of 
factors need to be considered. 

1. The extent of the CDRs 

The CDRs (Complementary Determining Regions) were defined by Wu and Kabat (refs. 4 and 5) on the 
20 basis of an analysis of the variability of different regions of antibody variable regions. Three regions per 
domain were recognised. In the light chain the sequences are 24-34, 50-56. 89-97 (numbering according 
to Kabat (ref. 4). Eu Index) inclusive and in the heavy chain the sequences are 31-35. 50-65 and 95-102 
inclusive. 

When antibody structures became available it became apparent that these CDR regions cor- 
25 responded in the main to loop regions which extended from the barrel framework of the light and 
heavy variable domains. For HI there was a discrepancy In that the loop was from 26 to 32 inclusive and 
for H2 the loop was 52 to 56 and for L2 from 50 to 53. However, with the exception of HI the CDR 
regions encompassed the loop regions and extended into the /S strand frameworks. In HI residue 26 
tends to be a serine and 27 a phenylalanine or tyrosine, residue 29 is a phenylalanine in most cases. 
30 Residues 28 and 30 which are surface residues exposed to solvent might be involved in antigen-binding. 
A prudent definition of the H1 CDR therefore would include residues 26-35 to include both the loop 
region and the hypervariable residues 33-35. 

It Is of interest to note the example of RIechmann et al (ref. 3), who used the residue 31-35 choice 
for CDR-H1 . In order to produce efficient antigen binding, residue 27 also needed to be recruited from 
36 the donor (rat) antibody. 

2. Non-CDR residues which contribute to antigen binding 

By examination of available X-ray structures we have identified a number of residues which may have an 
effect on net antigen binding and which can be demonstrated by experiment. These residues can be 
sub-divided Into a number of groups. 
40 2.1 Surface residues near CDR [all numbering as in Kabat et al (ref. 7)]. 

2.1.1. Heavy Chain - Key residues are 23, 71 and 73. Other residues which may contribute to a 
lesser extent are 1.3 and 76. Finally 25 is usually conserved but the murine residue should be 
used if there is a difference. 

2.1.2 Light Chain - Many residues close to the CDRs, e.g. 63, 65, 67 and 69 are conserved. If 
45 conserved none of the surface residues in the light chain are likely to have a major effect. 

However, if the murine residue at these positions is unusual, then it would be of benefit to analyse 
the likely contribution more closely. Other residues which may also contribute to binding are 1 and 
3, and also 60 and 70 if the residues at these positions and at 54 and 24 respectively are 
potentially able to form a salt bridge i.e. 60 + 54; 70 + 24. 
50 2.2 Packing residues near the CDRs. 

2.2.1. Heavy Chain - Key residues are 24, 49 and 78. Other key residues would be 36 if not a 
tryptophan, 94 if not an arginine. 104 and 106 if not glycines and 107 if not a threonine. Residues 
which may make a further contribution to stable packing of the heavy chain and hence improved 
affinity are 2, 4, 6, 38. 46, 67 and 69. 67 packs against the CDR residue 63 and this pair could be 
55 either both mouse or both human. Finally, residues which contribute to packing In this region but 

from a longer range are 18. 20, 80, 82 and 86. 82 packs against 67 and in turn 18 packs against 
82. 80 packs against 69 and in turn 20 packs against 80. 86 forms an H bond network with 38 and 
46. Many of the mouse-human differences appear minor e.g. Leu- He, but could have an minor 

9 
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impact on correct packing whicli could translate into altered positioning of the CDRs. 
2.2.2. Light Chain - Key residues are 48, 58 and 71. Other l<ey residues would be 6 if not 
glutamine, 35 if not tryptophan. 62 if not phenylalanine or tryosine, 64, 66, 68, 99 and 101 if not 
glycines and 102 if not a threonine. Residues which make a further contribution are 2, 4, 37, 45 
5 and 47. Finally residues 73 and 21 and 19 may make long distance packing contributions of a 

minor nature. 

2.3. Residues at the variable domain interface between heavy and light chains - In both the light and 
heavy chains most of the non-CDR interface residues are conserved. If a conserved residue is 
replaced by a residue of different character, e.g. size or charge, it should be considered for retention 

10 as the murine residue. 

2.3.1 . Heavy Chain - Residues which need to be considered are 37 if the residue is not a valine but 
is of larger side chain volume or has a charge or polarity. Other residues are 39 if not a glutamine, 
45 if not a leucine, 47 if not a tryptophan. 91 if not a phenylalanine or tyrosine. 93 if not an alanine 
and 103 if not a tryptophan. Residue 89 is also at the interface but is not in a position where the 

75 side chain could be of great impact. 

2.3.2. Light Chain - Residues which need to be considered are 36, if not a tyrosine. 38 if not a 
glutamine. 44 if not a proline, 46. 49 if not a tyrosine, residue 85. residue 87 if not a tyrosine and 
98 if not a phenylalanine. 

2.4. Variable-Constant region interface - The elbow angle between variable and constant regions may 
20 be affected by alterations in packing of key residues in the variable region against the constant region 

which may affect the position of Vl and Vh with respect to one another. Therefore it is worth noting 
the residues likely to be in contact with the constant region. In the heavy chain the surface residues 
potentially in contact with the variable region are conserved between mouse and human antibodies 
therefore the variable region contact residues may influence the V-C interaction. In the light chain the 
25 amino acids found at a number of the constant region contact points vary, and the V & C regions are 

not in such close proximity as the heavy chain. Therefore the influences of ,the light chain V-C 
interface may be minor. 

2.4.1. Heavy Chain - Contact residues are 7. 11, 41, 87, 108. 110, 112. 

2.4.2. Light Chain - In the light chain potentially contacting residues are 10. 12, 40, 80, 83, 103 and 
30 105. 

The above analysis coupled with our considerable practical experimental experience in the CDR- 
grafting of a number of different antibodies have lead us to the protocol given above. 

The present invention Is now described, by way of example only, with reference to the accompanying 
Figures 1-13. 



35 
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Figure 12 


55 


Figure 13 



shows DNA and amino acid sequences of the OKT3 light chain; 
shows DNA and amino acid sequences of the 0KT3 heavy chain; 

shows the alignment of the 0KT3 light variable region amino acid sequence with that of the 
light variable region of the human antibody REI; 

shows the alignment of the OKT3 heavy variable region amino acid sequence with that of 
the heavy variable region of the human antibody KOL; 

shows the heavy variable region amino acid sequences of 0KT3, KOL and various 
corresponding CDR grafts; 

shows the light variable region amino acid sequences of OKT3, REI and various cor- 
responding CDR grafts; 

shows a graph of binding assay results for various grafted 0KT3 antibodies* 
shows a graph of blocking assay results for various grafted 0KT3 antibodies; 
shows a similar graph of blocking assay results; 

shows similar graphs for both binding assay and blocking assay results; 

shows further similar graphs for both binding assay and blocking assay results: 

shows a graph of competition assay results for a minimally grafted OKT3 antibody 

compared with the 0KT3 murine reference standard, and 

shows a similar graph of competition assay results comparing a fully grafted 0KT3 
antibody with the murine reference standard. 
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DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 
EXAMPLE 1 
5 CDR-GRAFTING OF OKT3 
MATERIAL AND METHODS 

1. INCOMING CELLS 

10 Hybridoma cells producing antibody 0KT3 were provided by Ortho (seedlot 4882.1) and were grown up 
in antibiotic free Dulbecco's Modified Eagles Medium (DMEM) supplemented with glutamine and 5% 
foetal calf serum, and divided to provide both an overgrown supernatant for evaluation and cells for 
extraction of RNA. The overgrown supernatant was shown to contain 250 ug/mL murine lgG2a/kappa 
antibody. The supernatant was negative for murine lambda light chain and IgGi, lgG2b. lgG3. IgA and 

75 IgM heavy chain. 20m L of supernatant was assayed to confirm that the antibody present was 0KT3. 

2. MOLECULAR BIOLOGY PROCEDURES 

Basic molecular biology procedures were as described in Maniatis et al (ref. 9) with, in some cases, 
minor modifications. DNA sequencing was performed as described in Sanger et al (ref. 11) and the 
Amersham International Pic sequencing handbook. Site directed mutagenesis was as described in 
20 Kramer et al (ref. 12) and the Anglian Biotechnology Ltd. handbook. COS cell expression and metabolic 
labelling studies were as described In Whittle et al (ref. 13) 

3. RESEARCH ASSAYS 

3.1. ASSEMBLY ASSAYS Assembly assays were performed on supernatants from transfected COS 
cells to determine the amount of intact IgG present. 

25 3.1.1. COS CELLS TRANSFECTED WITH MOUSE 0KT3 GENES The assembly assay for intact 

mouse IgG in COS cell supernatants was an ELISA with the following format: 
96 well microtitre plates were coated with F(ab')2 goat anti-mouse IgG Fc. The plates were washed 
in water and samples added for 1 hour at room temperature. The plates were washed and F(ab')2 
goat anti-mouse IgG F(ab')2 (HRPO conjugated) was then added. Substrate was added to reveal 

30 the reaction. UPC10, a mouse lgG2a myeloma, was used as a standard. 

3.1.2. COS AND CHO CELLS TRANSFECTED WITH CHIMERIC OR CDR-GRAFTED OKT3 
GENES 

The assembly assay for chimeric or CDR-grafted antibody in COS ceil supernatants was an ELISA 
with the following format: 

35 96 well microtitre plates were coated with F(ab')2 goat anti-human IgG Fc. The plates were 

washed and samples added and incubated for 1 hour at room temperature. The plates were 
washed and monoclonal mouse anti-human kappa chain was added for 1 hour at room tempera- 
ture. 

The plates were washed and F(ab*)2 goat anti-mouse IgG Fc (HRPO conjugated) was added. 
40 Enzyme substrate was added to reveal the reaction. Chimeric B72.3 (lgG4) (ref. 13) was used as a 

standard. The use of a monoclonal anti-kappa chain in this assay allows grafted antibodies to be 
read from the chimeric standard. 

3.2. ASSAY FOR ANTIGEN BINDING ACTIVITY 

Material from COS cell supernatants was assayed for 0KT3 antigen binding activity onto CD3 positive 
45 cells in a direct assay. The procedure was as follows: 

HUT 78 cells (human T cell line, CD3 positive) were maintained in culture. Monolayers of HUT 78 
cells were prepared onto 96 well ELISA plates using poly-L-lysine and glutaraldehyde. Samples were 
added to the monolayers for 1 hour at room temperature. 

The plates were washed gently using PBS. F(ab*)2 goat anti-human IgG Fc (HRPO conjugated) or F- 
50 (ab')2 goat anti-mouse IgG Fc (HRPO conjugated) was added as appropriate for humanised or mouse 

samples. Substrate was added to reveal the reaction. 

The negative control for the cell-based assay was chimeric B72.3. The positive control was mouse 
Orthomune 0KT3 or chimeric 0KT3. when available. This cell-based assay was difficult to perform, 
and an alternative assay was developed for CDR-grafted OKT3 which was more sensitive and easier 
55 to carry out. 

In this system CDR-grafted 0KT3 produced by COS cells was tested for its ability to bind to the CD3- 
positive HPB-ALL (human peripheral blood acute lymphocytic leukemia) cell line. It was also tested for 
its ability to block the binding of murine 0KT3 to these cells. Binding was measured by the following 
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procedure: HPB-ALL cells were harvested from tissue culture. Cells were incubated at 4^C for 1 hour 
with various dilutions of test antibody, positive control antibody, or negative control antibody. The cells 
were washed once and incubated at 4°C for 1 hour with an FITC-labelled goat anti-human IgG (Fc- 
specific. mouse absorbed). The cells were washed twice and analysed by cytofluorography. Chimeric 

5 0KT3 was used as a positive control for direct binding. Cells incubated with mock- transfected COS 

cell supernatant, followed by the FITC-labelled goat anti-human IgG. provided the negative control. To 
test the ability of CDR-grafted 0KT3 to block murine OKT3 binding, the HPB-ALL cells were 
incubated at 4°C for 1 hour with various dilutions of test antibody or control antibody. A fixed 
saturating amount of FITC 0KT3 was added. The samples were incubated for 1 hour at 4°C, washed 

10 twice and analysed by cytofluorography. FlTC-labelled 0KT3 was used as a positive control to 

determine maximum binding. Unlabelled murine 0KT3 served as a reference standard for blocking. 
Negative controls were unstained cells with or without mock-transfected cell supernatant. The ability of 
the CDR-grafted 0KT3 light chain to bind CD3-positive cells and block the binding of murine OKT3 
was initially tested In combination with the chimeric 0KT3 heavy chain. The chimeric 0KT3 heavy 

75 chain is composed of the murine 0KT3 variable region and the human lgG4 constant region. The 

chimeric heavy chain gene is expressed in the same expression vector used for the CDR-grafted 
genes. The CDR-grafted light chain expression vector and the chimeric heavy chain expression vector 
were co-transfected into COS cells. The fully chimeric OKT3 antibody (chimeric light chain and 
chimeric heavy chain) was found to be fully capable of binding to CD3 positive cells and blocking the 

20 binding of murine 0KT3 to these cells. 

3.3 DETERMINATION OF RELATIVE BINDING AFFINITY 

The relative binding affinities of CDR-grafted anti-CD3 monoclonal antibodies were determined by 
competition binding (ref. 6) using the HPB-ALL human T cell line as a source of CD3 antigen, and 
fluorescein-conjugated murine OKT3 (FI-0KT3) of known binding affinity as a tracer antibody. The 

25 binding affinity of FI-0KT3 tracer antibody was determined by a direct binding assay in which 

increasing amounts of FI-0KT3 were incubated with HPB-ALL (5x1 0^) in PBS with 5% foetal calf 
serum for 60 min. at 4°C. Cells were washed, and the fluorescence Intensity was determined on a 
FACScan flow cytometer calibrated with quantitative microbead standards (Flow Cytometry Standards, 
Research Triangle Park, NC). Fluorescence intensity per antibody molecule (F/P ratio) was deter- 

30 mined by using microbeads which have a predetermined number of mouse IgG antibody binding sites 

(Simply Cellular beads, Flow Cytometry Standards). F/P equals the fluorescence intensity of beads 
isaturated with FI-0KT3 divided by the number of binding sites per bead. The amount of bound and 
free FI-0KT3 was calculated from the mean fluorescence intensity per cell, and the ratio of bound/free 
was plotted against the number of moles of antibody bound. A linear fit was used to determine the 

35 affinity of binding (absolute value of the slope). 

For competitive binding, increasing amounts of competitor antibody were added to a sub-saturating 
dose of FI-0KT3 and incubated with 5x1 0^ HPB-ALL in 200 mi of PBS with 5% foetal calf serum, for 
60 min at 4°C. The fluorescence intensities of the cells were measured on a FACScan flow cytometer 
calibrated with quantitative microbead standards. The concentrations of bound and free FI-OKT3 were 

40 calculated. The affinities of competing anti-bodies were calculated from the equation [X]-[0KT3] = 

(1/Kx) - (1/Ka), where Ka is the affinity of murine OKT3, Kx is the affinity of competitor X, [ ] Is the 
concentration of competitor antibody at which bound/free binding is R/2. and R is the maximal 
bound/free binding. 

4. cDNA LIBRARY CONSTRUCTION 

45 4.1. mRNA PREPARATION AND cDNA SYNTHESIS 

OKT3 producing cells were grown as described above and 1.2 x 10^ cells harvested and mRNA 
extracted using the guanidinlum/LiCI extraction procedure. cDNA was prepared by priming from Oligo- 
dT to generate full length cDNA. The cDNA was methylated and EcoRI linkers added for cloning. 
4.2. LIBRARY CONSTRUCTION 

50 The cDNA library was ligated to pSP65 vector DNA which had been EcoRI cut and the 5' phosphate 

groups removed by calf intestinal phosphatase (EcoRI /CIP). The ligation was used to transform high 
transformation efficiency Escherichia coli (E.coli) HB101. A cDNA library was prepared. 3600 colonies 
were screened for the light chain and 10000 colonies were screened for the heavy chain. 

5. SCREENING 

55 E.coli colonies positive for either heavy or light chain probes were identified by oligonucleotide screening 
using the oligonucleotides: 5' TCCAGATGTTAACTGCTCAC for the light chain, which is complementary 
to a sequence in the mouse kappa constant region, and 5' CAGGGGCCAGTGGATGGATAGAC for the 
heavy chain which is complementary to a sequence in the mouse lgG2a constant CHI domain region. 12 
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light chain and 9 heavy chain clones were Identified and taken for second round screening. Positive 
clones from the second round of screening were grown up and DNA prepared. The sizes of the gene 
inserts were estimated by gel electrophoresis and inserts of a size capable of containing a full length 
cDNA were subcloned into Ml 3 for DNA sequencing. 

5 6. DNA SEQUENCING 

Clones representing four size classes for both heavy and light chains were obtained in M13. DNA 
sequence for the 5' untranslated regions, signal sequences, variable regions and 3' untranslated regions 
of full length cDNAs [Figures 1(a) and 2(a)] were obtained and the corresponding amino acid sequences 
predicted [(Figures 1(b) and 2(b)], In Figure 1(a) the untranslated DNA regions are shown in uppercase, 

70 and in both Figures 1 and 2 the signal sequences are underlined. 

7. CONSTRUCTION OF cDNA EXPRESSION VECTORS 

Celltech expression vectors are based on the plasmid pEEShCIS/IV (ref. 14). A polylinker for the insertion 
of genes to be expressed has been introduced after the major immediate early promoter/enhancer of the 
human Cytomegalovirus (hCMV). Marker genes for selection of the plasmid in transfected eukaryotic 

15 cells can be inserted as BamHI cassettes in the unique BamHI site of pEE6 hCMV; for instance, the 
neo marker to provide pEE6 hCMV neo. It is usual practice to insert the neo and gpt markers prior to 
insertion of the gene of interest, whereas the GS marker is inserted last because of the presence of 
internal EcoRI sites in the cassette. 

The selectable markers are expressed from the SV40 late promoter which also provides an origin of 
20 replication so that the vectors can be used for expression in the COS cell transient expression system. 

The mouse sequences were excised from the M13 based vectors described above as EcoR1 fragments 
and cloned into either pEE6-hCMV-neo for the heavy chain and into EE6-hCMV-gpt for the light chain to 
yield vectors pJA136 and pJA135 respectively. 

8. EXPRESSION OF cDNAS IN COS CELLS 

25 Plasmids pJA135 and pJA136 were co-transf acted into COS cells and supernatant from the transient 
expression experiment was shown to contain assembled antibody which bound to T-cell enriched 
lymphocytes. Metabolic labelling experiments using ^^S methionine showed expression and assembly of 
heavy and light chains. 

9. CONSTRUCTION OF CHIMERIC GENES 

30 Construction of chimeric genes followed a previously described strategy [Whittle et al (ref. 13)]. A 
restriction site near the 3' end of the variable domain sequence is identified and used to attach an 
oligonucleotide adapter coding for the remainder of the mouse variable region and a suitable restriction 
site for attachment to the constant region of choice. 
9.1. LIGHT CHAIN GENE CONSTRUCTION 
35 The mouse light chain cDNA sequence contains an Aval site near the 3* end of the variable region 

[Fig. 1(a)]. The majority of the sequence of the variable region was isolated as a 396 bp, EcoR1-Aval 
fragment. An oligonucleotide adapter was designed to replace the remainder of the 3' region of the 
variable region from the Aval site and to include the 5' residues of the human constant region up to 
and including a unique Narl site which had been previously engineered into the constant region. 
40 A Hindi! 1 site was introduced to act as a marker for insertion of the linker. 

The linker was ligated to the Vl fragment and the 413 bp EcoRI -Narl adapted fragment was purified 
from the ligation mixture. 

The constant region was isolated as an Narl -BamHI fragment from an Ml 3 clone NW361 and was 
ligated with the variable region DNA into an EcoRI /Bam HI /CI P pSP65 treated vector in a three way 
45 reaction to yield plasmid JA143. Clones were isolated after transformation into E.coli and the linker 

and junction sequences were confirmed by the presence of the Hindi 11 site and by DNA sequencing. 
9.2 LIGHT CHAIN GENE CONSTRUCTION - VERSION 2 

The construction of the first chimeric light chain gene produces a fusion of mouse and human amino 
acid sequences at the variable-constant region junction. In the case of the 0KT3 light chain the amino 
50 acids at the chimera junction are: 

Leu-Glu-Ile-* Asn-Arq/ -/Thr -Val-Ala *Ala 

VARIABLE COHSTANT 

55 

This arrangement of sequence introduces a potential site for Asparagine (Asn) linked (N-linked) 
glycosylation at the V-C junction. Therefore, a second version of the chimeric light chain 
oligonucleotide adapter was designed in which the threonine (Thr), the first amino acid of the human 

13 
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constant region, was replaced with the equivalent amino acid from the mouse constant region. Alanine 
(Ala). 

An internal Hindi 11 site was not included in this adapter, to differentiate the two chimeric light chain 
genes. 

5 The variable region fragment was isolated as a 376 bp EcoRI-Aval fragment. The oligonucleotide linker 
was ligated to Narl cut pNW361 and then the adapted 396bp constant region was isolated after recutting 
the modified pNW361 with EcoRI. The variable region fragment and the modified constant region 
fragment were ligated directly into EcoRI /C1P treated pEE6hCMVneo to yield pJA137. Initially all clones 
examined had the insert In the Incorrect orientation. Therefore, the insert was re-isolated and recloned to 
10 turn the insert round and yield plasmid pJA141. Several clones with the insert in the correct orientation 
were obtained and the adapter sequence of one was confirmed by DNA sequencing 
9.3. HEAVY CHAIN GENE CONSTRUCTION 

9.3.1. CHOICE OF HEAVY CHAIN GENE ISOTYPE 
The constant region isotype chosen for the heavy chain was human lgG4. 
15 9.3.2. GENE CONSTRUCTION 

The heavy chain cDNA sequence showed a Bani site near the 3* end of the variable region [Fig. 2(a)- 
]. The majority of the sequence of the variable region was isolated as a 426bp. EcoRI /C1P/Ban1 
fragment. An oligonucleotide adapter was designated to replace the remainder of the 3' region of the 
variable region from the Bani site up to and including a unique HIndlll site which had been previously 
20 engineered into the first two amino acids of the constant region. 

The linker was ligated to the Vh fragment and the EcoRI -Hindi 11 adapted fragment was purified from 
the ligation mixture. The variable region was ligated to the constant region by cutting pJA91 with 
EcoRI and HIndlll removing the intron fragment and replacing it with the Vh to yield pJA142. Clones 
were isolated after transformation into E.coll JM101 and the linker and junction sequences were 
25 confirmed by DNA sequencing. (N.B. The Hindi 11 site is lost on cloning). 

10. CONSTRUCTION OF CHIMERIC EXPRESSION VECTORS 
10.1. neo AND gpt VECTORS 

The chimeric light chain (version 1) was removed from pJA143 as an EcoRI fragment and cloned into 
EcoR1/C1P treated pEE6hCMVneo expression vector to yield pJA145. Clones with the insert in the 
30 correct orientation were identified by restriction mapping. 

The chimeric light chain (version 2) was constructed as described above. 

The chimeric heavy chain gene was isolated from pJAl42 as a 2.5Kbp EcoRI /BamHI fragment and 
cloned into the EcoRI /Bell /CI P treated vector fragment of a derivative of pEEShCMVgpt to yield 
plasmid pJA144. 

35 10.2. GS SEPARATE VECTORS 

GS versions of pJA141 and pJA144 were constructed by replacing the neo and gpt cassettes by a 
BamHI /Sa11 /CI P treatment of the plasmids, isolation of the vector fragment and ligation to a GS- 
containing fragment from the plasmid pR049 to yield the light chain vector pJA179 and the heavy 
chain vector pJAI 80. 

40 10.3. GS SINGLE VECTOR CONSTRUCTION 

Single vector constructions containing the cL (chimeric light), cH (chimeric heavy) and GS genes on 
one plasmid In the order cL-cH-GS, or cH-cL-GS and with transcription of the genes being head to tail 
e.g. cL>cH>GS were constructed. These plasmids were made by treating pJA179 or pJA180 with 
BamHI/CIP and ligating in a BgI11/Hind111 hCMV promoter cassette along with either the 

45 Hindi 11/BamH1 fragment from pJA141 into pJAISO to give the cH-cL-GS plasmid pJA182 or the 

Hindi 11/BamH1 fragment from pJA144 into pJA179 to give the cL-cH-GS plasmid pJAl81. 

11. EXPRESSION OF CHIMERIC GENES 
11.1. EXPRESSION IN COS CELLS 

The chimeric antibody plasmid pJA145 (cL) and pJA144 (cH) were co-transfected Into COS cells and 
50 supernatant from the transient expression experiment was shown to contain assembled antibody 

which bound to the HUT 78 human T-cell line. Metabolic labelling experiments using ^^S methionine 
showed expression and assembly of heavy and tight chains. However the light chain mobility seen on 
reduced gels suggested that the potential glycosylation site was being glycosylated. Expression in 
COS cells in the presence of tunicamycin showed a reduction in size of the light chain to that shown 
55 for control chimeric antibodies and the OKT3 mouse light chain. Therefore JA141 was constructed 

and expressed. In this case the light chain did not show an aberrant mobility or a size shift In the 
presence or absence of tunicamycin. This second version of the chimeric light chain, when expressed 
in association with chimeric heavy (cH) chain, produced antibody which showed good binding to HUT 
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78 cells, tn both cases antigen binding was equivalent to that of the nnouse antibody. 

11.2 EXPRESSION IN CHINESE HAMSTER OVARY (CHO) CELLS Stable cell lines have been 

prepared from plasnrtids PJA141/pJA144 and from pJAI 79/p JA1 80. pJA181 and pJA182 by transfec- 

tion into CHO cells. 
5 12. CDR-GRAFTING 

The approach taken was to try to introduce sufficient mouse residues into a human variable region 
framework to generate antigen binding activity comparable to the mouse and chimeric antibodies. 

12.1. VARIABLE REGION ANALYSIS 

From an examination of a small database of structures of antibodies and antigen-antibody complexes 
10 it is clear that only a small number of antibody residues make direct contact with antigen. Other 

residues may contribute to antigen binding by positioning the contact residues in favourable configu- 
rations and also by inducing a stable packing of the individual variable domains and stable interaction 
of the light and heavy chain variable domains. 

The residues chosen for transfer can be identified in a number of ways: 
75 (a) By examination of antibody X-ray crystal structures the antigen binding surface can be 

predominantly located on a series of loops, three per domain, which extend from the B-barrel 
framework. 

(b) By analysis of antibody variable domain sequences regions of hypervariability [termed the 
Complementarity Determining Regions (CDRs) by Wu and Kabat (ref. 5)] can be identified. In the 

20 most but not all cases these CDRs correspond to. but extend a short way beyond, the loop regions 

noted above. 

(c) Residues not identified by (a) and (b) may contribute to antigen binding directly or indirectly by 
affecting antigen binding site topology, or by inducing a stable packing of the individual variable 
domains and stabilising the Inter- variable domain interaction. These residues may be identified 

25 either by superimposing the sequences for a given antibody on a known structure and looking at 

key residues for their contribution, or by sequence alignment analysis and noting "idiosyncratic" 
residues followed by examination of their structural location and likely effects. 

12.1.1. LIGHT CHAIN 

Figure 3 shows an alignment of sequences for the human framework region RE1 and the 0KT3 

30 light variable region. The structural loops (LOOP) and CDRs (KABAT) believed to correspond to the 

antigen binding region are marked. Also marked are a number of other residues which may also 
contribute to antigen binding as described in 13.1(c). Above the sequence in Figure 3 the residue 
type indicates the spatial location of each residue side chain, derived by examination of resolved 
structures from X-ray crystallography analysis. The key to this residue type designation is as 

36 follows: 

N - near to CDR (From X-ray Structures) 
P - Packing B - Buried Non-Packing 
S - Surface E - Exposed 
I - Interface * - Interface 

40 ' Packing/Part Exposed 

? - Non-CDR Residues which may require to be left as Mouse sequence. Residues underlined In 
Figure 3 are amino acids. RE1 was chosen as the human framework because the light chain is a 
kappa chain and the kappa variable regions show higher homology with the mouse sequences than 
a lambda light variable region, e.g. KOL (see below). RE1 was chosen in preference to another 

45 kappa light chain because the X-ray structure of the light chain has been determined so that a 

structural examination of individual residues could be made. 

12.1.2. HEAVY CHAIN 

Similariy Figure 4 shows an alignment of sequences for the human framework region KOL and the 
0KT3 heavy variable region. The structural loops and CDRs believed to correspond to the antigen 

50 binding region are marked. Also marked are a number of other residues which may also contribute 

to antigen binding as described in 12.1(c). The residue type key and other indicators used in 
Figure 4 are the same as those used in Figure 3. KOL was chosen as the heavy chain framework 
because the X-ray structure has been determined to a better resolution than, for example, NEWM 
and also the sequence alignment of OKT3 heavy variable region showed a slightly better homology 

65 to KOL than to NEWM. 

12.2. DESIGN OF VARIABLE GENES 

The variable region domains were designed with mouse variable region optimal codon usage 
[Grantham and Perrin (ref. 15)] and used the B72.3 signal sequences [Whittle et al (ref. 13)]. The 
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sequences were designed to be attached to the constant region in the same way as for the chimeric 
genes described above. Some constructs contained the "Kozak consensus sequence" [Kozal< (ref. 
16)] directly linked to the 5' of the signal sequence in the gene. This sequence motif is believed to 
have a beneficial role in translation initiation in eukaryotes. 

5 12,3. GENE CONSTRUCTION 

To build the variable regions, various strategies are available. The sequence may be assembled by 
using oligonucleotides in a manner similar to Jones et al (ref. 17) or by simultaneously replacing all of 
the CDRs or loop regions by oligonucleotide directed site specific mutagenesis in a manner similar to 
Verhoeyen et al (ref. 2). Both strategies were used and a list of constructions is set out in Tables 1 

70 and 2 and Figures 4 and 5. It was noted in several cases that the mutagenesis approach led to 

deletions and rearrangements in the gene being remodelled, while the success of the assembly 
approach was very sensitive to the quality of the oligonucleotides. 
13. CONSTRUCTION OF EXPRESSION VECTORS 

Genes were isolated from Ml 3 or SP65 based intermediate vectors and cloned into pEE6hCMVneo for 
15 the light chains and pEE6hCMVgpt for the heavy chains in a manner similar to that for the chimeric 
genes as described above. 
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TABLE 1 CDR-CRAFTED GENE CONSTRUCTS 

CODE HOUSE SEQUENCE HETHOD OF KOZAK 

CONTENT CONSTRUCTION SEQUENCE 



10 



15 



20 



LIGHT 


CHAIN 


ALL 


, HUMAN 


FRAKEUORK RXl 












121 


26-32. 


50- 


56. 


91- 


96 


inclusive 


SDK snd 


gene 


assenbly 




n 


12 lA 


26-32. 


50- 


56, 


91- 


96 


inclusive 


Partial 


gene 


assenbly 


n.d. 


♦ 




-H, 3. 


46. 


47 


















121B 


26-32. 


50. 


56. 


91- 


96 


inclusive 


Partial 


gene 


assembly 


n.d. 






+ 46, 


47 




















221 


24-24. 


50- 


56. 


91- 


96 


inclusive 


Partial 


gene 


assembly 






221A 


24-34. 


50- 


56. 


91- 


96 


inclusive 


Partial 


gene 


assembly 


+ 






+1. 3. 


46. 


47 


















2215 


24-34, 


50- 


56. 


91- 


96 


inclusive 


Partial 


gene 


assembly 








■fl. 3 






















221C 


24-34, 


50- 


56. 


91- 


96 


inclusive 


Partial 


gene 


assembly 







25 


HEAVY 


CHAIN 


ALL HUMAN FRAHEVORX KOL 








121 


26-32. 


50-56, 


95-lOOB inclusive 


Gene assembly 


n.d. 


+ 




131 


26-32. 


50-58. 


95-lOOB inclusive 


Gene assembly 


n.d. 






141 


26-32. 


50-65, 


95-lOOB inclusive 


Partial gene assembly 


+ 


n.d. 


30 


321 


26-35. 


50-56. 


95-lOOB inclusive 


Partial gene assembly 


+ 


n.d. 




331 


26-35, 


50-58. 


95-lOOB inclusive 


Partial gene assembly 
Gene assembly 








341 


26-35, 


50-65, 


95-lOOB inclusive 


SDH 


+ 




35 










Partial gene assembly 




+ 




341A 


26-35, 


50-65. 


95-lOOB inclusive 


Gene assembly 


n.d. 


+ 






+6. 23. 


24. 48. 49, 71. 73. 76. 












78. 88. 


91 (+63 human) 








40 


341B 


26-35. 


50-65, 


95-lOOB inclusive 


Gene assembly 


n.d. 


+ 






48. 49. 71. 


73. 76. 78, 88. 91 












(+63 ♦ 


human) 











45 



50 



KEY 

n.d. no c done 

SDH Sice directed mutagenesis 

Gene assembly Variable region assembled entirely from oligonucleotides 
Partial gene Variable region assembled by combination of restriction 
assembly fragments either from other genes originally created by SDM 

and gene assembly or by oligonucleotide assembly of part of 
the variable region and reconstruction vith restriction 
fragments from other genes originally created by SDH and gene 
assembly 



14. EXPRESSION OF CDR-GRAFTED GENES 

14.1. PRODUCTION OF ANTIBODY CONSISTING OF GRAFTED LIGHT (gL) CHAINS WITH MOUSE 

HEAW (mH) OR CHIMERIC HEAVY (cH) CHAINS 

All gL chains, in association with mH or cH produced reasonable amounts of antibody. Insertion of the 
Kozak consensus sequence at a position 5' to the ATG (kgL constructs) however, led to a 2-5 fold 
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improvement in net expression. Over an extended series of experiments expression levels were raised 
from approximately 200ng/ml to approximately 500 ng/ml for kgL/cH or kgl_/mH combinations. 
When direct binding to antigen on HUT 78 cells was measured, a construct designed to include 
mouse sequence based on loop length (gL121) did not lead to active antibody in association with mH 

5 or cH. A construct designed to include mouse sequence based on Kabat CDRs (gL221) demonstrated 

some weak binding in association with mH or cH. However, when framework residues 1, 3, 46, 47 
were changed from the human to the murine 0KT3 equivalents based on the arguments outlined in 
Section 12.1 antigen binding was demonstrated when both of the new constructs, which were termed 
121 A and 221 A were co-expressed with cH. When the effects of these residues were examined in 

10 more detail, it appears that residues 1 and 3 are not major contributing residues as the product of the 

gL221B gene shows little detectable binding activity in association with cH. The light chain product of 
gL22lC, in which mouse sequences are present at 46 and 47, shows good binding activity in 
association with cH. 

14.2 PRODUCTION OF ANTIBODY CONSISTING OF GRAFTED HEAVY (gH) CHAINS WITH MOUSE 
75 LIGHT (mL) OR CHIMERIC LIGHT (cL) CHAINS 

Expression of the gH genes proved to be more difficult to achieve than for gL. First, inclusion of the 
Kozak sequence appeared to have no marked effect on expression of gH genes. Expression appears 
to be slightly improved but not to the same degree as seen for the grafted light chain. 
Also, It proved difficult to demonstrate production of expected quantities of material when the loop 
20 choice (amino acid 26-32) for CDR1 is used, e.g. gH121. 131, 141 and no conclusions can be drawn 

about these constructs. 

Moreover, co-expression of the gH341 gene with cL or mL has been variable and has tended to 
produce lower amounts of antibody than the cH/cL or mH/mL combinations. The alterations to gH341 
to produce gH341A and gH341B lead to improved levels of expression. 
25 This may be due either to a general increase in the fraction of mouse sequence in the variable region, 

or to the alteration at position 63 where the residue is returned to the human amino acid Valine (Val) 
from Phenylalanine (Phe) to avoid possible internal packing problems with the rest of the human 
framework. This arrangement also occurs in gH331 and gH321 . 

When gH321 or gH331 were expressed in association with cL, antibody was produced but antibody 

30 binding activity was not detected. 

When the more conservative gH341 gene was used antigen binding could be detected in association 
with cL or mL, but the activity was only marginally above the background level. When further mouse 
residues were substituted based on the arguments in 12.1, antigen binding could be clearly 
demonstrated for the antibody produced when kgH341A and kgH341B were expressed in association 

35 with cL. 

14.3 PRODUCTION OF FULLY CDR-GRAFTED ANTIBODY 

The kgL221A gene was co-expressed with kgH341, kgH341A or kgH341B. For the combination 
kgH221A/kgH341 very little material was produced in a normal COS cell expression. 
For the combinations kgL221A/kgH341A or kgH221A/kgH341B amounts of antibody similar to gL/cH 
40 was produced. 

In several experiments no antigen binding activity could be detected with kgH221 A/gH341 or 
kgH221 A/kgH341 combinations, although expression levels were very low. 

Antigen binding was detected when kgL221 A/kgH341A or kgH221 A/kgH341B combinations were 
expressed. In the case of the antibody produced from the kgL221A/kgH341 A combination the antigen 
45 binding was very similar to that of the chimeric antibody. 

An analysis of the above results is given below. 
15. DISCUSSION OF CDR-GRAFTING RESULTS 

In the design of the fully humanised antibody the aim was to transfer the minimum number of mouse 
amino acids that would confer antigen binding onto a human antibody framework. 
50 15.1. LIGHT CHAIN 

15.1.1. EXTENT OF THE CDRs 

For the tight chain the regions defining the loops known from structural studies of other antibodies 
to contain the antigen contacting residues, and those hypervariable sequences defined by Kabat et 
al (refs. 4 and 5) as Complementarity Determining Regions (CDRs) are equivalent for CDR2. For 
55 CDR1 the hypervariable region extends from residues 24-34 inclusive while the structural loop 

extends from 26-32 inclusive, in the case of 0KT3 there is only one amino acid difference between 
the two options, at amino acid 24, where the mouse sequence is a serine and the human 
framework RE1 has glutamine. For CDR3 the loop extends from residues 91-96 inclusive while the 
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Kabat hypervariability extends from residues 89-97 inclusive. For 0KT3 annino acids 89, 90 and 97 
are the same between 0KT3 and RE1 (Fig. 3), When constructs based on the loop choice for 
CDR1 (gL121) and the Kabat choice (gL221) were made and co-expressed with mH or cH no 
evidence for antigen binding activity could be found for gL121, but trace activity could be detected 
5 for the gL221, suggesting that a single extra mouse residue in the grafted variable region could 

have some detectable effect. Both gene constructs were reasonably well expressed in the transient 
expression system. 
15.1.2. FRAMEWORK RESIDUES 

The remaining framework residues were then further examined, in particular amino acids known 
10 from X-ray analysis of other antibodies to be close to the CDRs and also those amino acids which 

in OKT3 showed differences from the consensus framework for the mouse subgroup (subgroup VI) 
to which OKT3 shows most homology. Four positions 1, 3, 46 and 47 were identified and their 
possible contribution was examined by substituting the mouse amino acid for the human amino 
acid at each position. Therefore gL221A (gL221 + D1Q, Q3V. L46R. L47W. see Figure 3 and 
75 Table 1) was made, cloned in EE6hCMVneo and co-expressed with cH (pJA144). The resultant 

antibody was well expressed and showed good binding activity. When the related genes gL221B 
(gL221 + D1Q, Q3V) and gL221C (gL221 + L46R. L47W) were made and similarly tested, while 
both genes produced antibody when co-expressed with cH, only the gL221C/cH combination 
showed good antigen binding. When the gLl2lA (gL121 + D1Q. Q3V. L46R. L47W) gene was 
20 made and co-expressed with cH, antibody was produced which also bound to antigen. 

15.2. HEAVY CHAIN 

15.2.1. EXTENT OF THE CDRs 

For the heavy chain the loop and hypervariability analyses agree only in CDR3. For CDR1 the loop 
region extends from residues 26-32 inclusive whereas the Kabat CDR extends from residues 31-35 

25 inclusive. For CDR2 the loop region is from 50-58 inclusive while the hypervariable region covers 

amino acids 50-65 inclusive. Therefore humanised heavy chains were constructed using the 
framework from antibody KOL and with various combinations of these CDR choices, including a 
shorter choice for CDR2 of 50-56 inclusive as there was some uncertainty as to the definition of the 
end point for the CDR2 loop around residues 56 to 58. The genes were co-expressed with mL or 

30 cL initially. In the case of the gH genes with loop choices for CDR1 e.g. gH121. gH131, gH141 very 

little antibody was produced in the culture supernatants. As no free light chain was detected it was 
presumed that the antibody was being made and assembled inside the cell but that the heavy 
chain was aberrant in some way, possibly incorrectly folded, and therefore the antibody was being 
degraded internally. In some experiments trace amounts of antibody could be detected in ^^8 

35 labelling studies. 

As no net antibody was produced, analysis of these constructs was not pursued further. 
When, however, a combination of the loop choice and the Kabat choice for CDR1 was tested 
(mouse amino acids 26-35 inclusive) and in which residues 31 (Ser to Arg). 33 (Ala to Thr), and 35 
(Tyr to His) were changed from the human residues to the mouse residue and compared to the 

40 first series, antibody was produced for gH321, kgH331 and kgH341 when co-expressed with cL. 

Expression was generally low and could not be markedly improved by the insertion of the Kozak 
consensus sequence 5' to the ATG of the signal sequence of the gene, as distinct from the case of 
the gL genes where such insertion led to a 2-5 fold increase in net antibody production. However, 
only in the case of gH341/mL or kgH341/cL could marginal antigen binding activity be dem- 

45 onstrated. When the kgH341 gene was co-expressed with kgL221A, the net yield of antibody was 

too low to give a signal above the background level In the antigen binding assay. 

15.2.2. FRAMEWORK RESIDUES 

As in the case of the light chain the heavy chain frameworks were re-examined. Possibly because 
of the lower initial homology between the mouse and human heavy variable domains compared to 
50 the light chains, more amino acid positions proved to be of interest. Two genes kgH341A and 

kgH341B were constructed, with 11 or 8 human residues respectively substituted by mouse 
residues compared to gH341, and with the CDR2 residue 63 returned to the human amino acid 
potentially to improve domain packing. Both showed antigen binding when combined with cL or 
kgL221A. the kgH341A gene with all 11 changes appearing to be the superior choice. 
55 15.3 INTERIM CONCLUSIONS 

It has been demonstrated, therefore, for OKT3 that to transfer antigen binding ability to the humanised 
antibody, mouse residues outside the CDR regions defined by the Kabat hypervariability or structural 
loop choices are required for both the light and heavy chains. Fewer extra residues are needed for the 
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light chain, possibly due to the higher initial homology between the mouse and human kappa variable 

regions. 

Of the changes seven (1 and 3 from the light chain and 6, 23. 71 , 73 and 76 from the heavy chain) 
are predicted from a knowledge of other antibody structures to be either partly exposed or on the 

5 antibody surface. It has been shown here that residues 1 and 3 in the light chain are not absolutely 

required to be the mouse sequence; and for the heavy chain the gH341 B heavy chain in combination 
with the 221 A light chain generated only weak binding activity. Therefore the presence of the 8. 23 
and 24 changes are important to maintain a binding affinity similar to that of the murine antibody. It 
was important, therefore, to further study the individual contribution of othe other 8 mouse residues of 

10 the kgH341A gene compared to kgH341. 

16. FURTHER CDR-GRAFTING EXPERIMENTS 

Additional CDR-grafted heavy chain genes were prepared substantially as described above. With 
reference to Table 2 the further heavy chain genes were based upon the gh341 (plasmid pJA178) and 
gH341A (plasmid pJA185) with either mouse 0KT3 or human KOL residues at 6. 23. 24. 48. 49, 63. 71, 

75 73. 76, 78, 88 and 91, as indicated. The CDR-grafted light chain genes used in these further experiments 
were gL221. gL221A, gL221B and gL221C as described above. 
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TABLE 2 



0KT3 HEAVY CHAIN CDR CRAFTS 
1. ^341 and derivatives 
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0KT3 LIGHT CHAIN CDR CRAFTS 



2. gL221 and derivatives 
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The CDR-grafted heavy and light chain genes were co-expressed in COS cells either with one another 
55 in various combinations but also with the corresponding murine and chimeric heavy and light chain genes 
substantially as described above. The resultant antibody products were then assayed in binding and 
blocking assays with HPB-ALL cells as described above. 



21 



EP000620276 ffile:/A\clcwas03\firmdata\lp\FolevPat\PatentDocuments\£P000620276.CPCl 



Pag e 22 of 66 



EP 0 620 276 A1 



The results of the assays for various grafted heavy chains co-expressed with the gL221C light chain are 
given in Figures 7 and 8 (for the JA184, JA185, JA197 and JA198 constructs - see Table 2), in Figure 9 (for 
the JA183. JA184, JA185 and JA197 constructs) in Figure 10 (for the chimeric, JA185, JA199, JA204, 
JA205, JA207, JA208 and JA209 constructs) and in Figure 11 (for the JA183, JA1B4. JA185. JA198. JA203. 

5 JA205 and JA206 constructs). 

The basic grafted product without any human to nnurine changes in the variable franneworks, i.e. gL221 
co-expressed with gh341 (JA178), and also the "fully grafted" product, having most human to murine 
changes in the grafted heavy chain framework, i.e. gL221C co-expressed with gh341A (JA185), were 
assayed for relative binding affinity in a competition assay against murine 0KT3 reference standard, using 

10 HPB-ALL cells. The assay used was as described above in section 3.3. The results obtained are given in 
Figure 12 for the basic grafted product and in Figure 13 for the fully grafted product. These results indicate 
that the basic grafted product has negllbible binding ability as compared with the OKT3 murine reference 
standard; whereas the "fully grafted" product has a binding ability very similar to that of the OKT3 murine 
reference standard. 

75 The binding and blocking assay results indicate the following: 

The JA198 and JA207 constructs appear to have the best binding characteristics and similar binding 
abilities, both substantially the same as the chimeric and fully grafted gH341A products. This indicates that 
positions 88 and 91 and position 76 are not highly critical for maintaining the 0KT3 binding ability; whereas 
at least some of positions 6, 23. 24, 48, 49, 71 . 73 and 78 are more important. 

20 This is borne out by the finding that the JA209 and JA199. although of similar binding ability to one 
another, are of lower binding ability than the JA198 and JA207 constructs. This indicates the importance of 
having mouse residues at positions 71, 73 and 78, which are either completely or partially human in the 
JA199 and JA209 constructs respectively. 

Moreover, on comparing the results obtained for the JA205 and JA183 constructs it is seen that there is 

25 a decrease in binding going from the JA205 to the JA183 constructs. This indicates the importance of 
retaining a mouse residue at position 23, the only position changed between JA205 and JA1 83. 

These and other results lead us to the conclusion that of the 1 1 mouse framework residues used in the 
gH341A (JA185) construct, it is important to retain mouse residues at all of positions 6, 23, 24, 48 and 49, 
and possibly for maximum binding affinity at 71 , 73 and 78. 

30 Similar Experiments were carried out to CDR-graft a number of the rodent antibodies including 
antibodies having specificity for CD4 (OKT4). lCAM-1 (R6-5). TAG72 (B72.3). and TNFa(61E71, 101.4, 
hTNFI, hTNF2 and hTNF3). 

EXAMPLE 2 

35 

CDR-GRAFTING OF A MURINE ANTI-CD4 T CELL RECEPTOR ANTIBODY, 0KT4A 

Anti 0KT4A CDR-grafted heavy and light chain genes were prepared, expressed and tested substan- 
tially as described above in Example 1 for CDR-grafted 0KT3. The CDR grafting of 0KT4A is described in 

40 detail in Ortho patent application PCT/GB 90 of even date herewith entitled "Humanised Antibodies". 

The disclosure of this Ortho patent application PCT/GB 90 is incorporated herein by reference. A 

number of CDR-grafted 0KT4 antibodies have been prepared. Presently the CDR-grafted 0KT4A of choice 
is the combination of the grafted light chain LCDR2 and the grafted heavy chain HCDR10. 

46 THE LIGHT CHAIN 

The human acceptor framework used for the grafted light chains was RE1. The preferred LCDR2 light 
chain has human to mouse changes at positions 33, 34, 38, 49 and 89 in addition to the structural loop 
CDRs. Of these changed positions, positions 33, 34 and 89 fall within the preferred extended CDRs of the 

50 present invention (positions 33 and 34 in CDR1 and position 89 in CDR3). 

The human to murine changes at positions 38 and 49 corresponds to positions at which the amino acid 
residues are preferably donor murine amino acid residues in accordance with the present invention. 
A comparison of the amino acid sequences of the donor murine light chain variable domain and the RE1 
human acceptor light chain variable further reveals that the murine and human residues are identical at all 

55 of positions 46, 48 and 71 and at all of positions 2, 4, 6, 35. 36, 44. 47. 62. 64-69. 85. 87. 98, 99 and 101 
and 102. However the amino acid residue at position 58 in LCDR2 is the human RE1 framework residue not 
the mouse 0KT4 residue as would be preferred in accordance with the present invention. 
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THE HEAVY CHAIN 



10 



IS 



The human acceptor framework used for the grafted heavy chains was KOL. 
The preferred COR graft HCDR10 heavy chain has human to mouse changes at positions 24. 35. 57. 58, 
60, 88 and 91 in addition to the structural loop CDRs. 

Of these positions, positions 35 (CDR1) and positions 57. 58 and 60 (CDR2) fall within the preferred 
extended CDRs of the present invention. Also the human to mouse change at position 24 corresponds to a 
position at which the amino acid residue is a donor murine residue in accordance with the present invention. 
Moreover, the human to mouse changes at positions 88 and 91 correspond to positions at which the amino 
acid residues are optionally donor murine residues. 

Moreover, a comparison of the murine OKT4A and human KOL heavy chain variable amino acid sequences 
reveals that the murine and human residues are identical at all of positions 23. 49, 71 , 73 and 78 and at all 
of positions 2, 4, 6, 25. 36, 37, 39. 47, 48. 93. 94. 103. 104. 106 and 107. 

Thus the 0KT4A CDR-grafted heavy chain HCDR10 corresponds to a particularly preferred embodiment 

according to the present invention. 



EXAMPLE 3 



20 



25 



CDR-GRAFTING OF AN ANTI-MUCIN SPECIFIC MURINE ANTIBODY. B72.3 

The cloning of the genes coding for the anti-mucin specific murine monoclonal antibody B72.3 and the 
preparation of B72.3 mouse-human chimeric antibodies has been described previously (ref. 13 and WO 
89/01783). CDR-grafted versions of B72.3 were prepared as follows, 
(a) B72.3 Light Chain 

CDR-grafting of this light chain was accomplished by direct transfer of the murine CDRs into the 
framework of the human light chain RE1 . The regions transferred were: 



30 



CDR 


Residues 


Number 




1 


24-34 


2 


50-56 


3 


90-96 



35 



40 



45 



SO 



55 



The activity of the resulting grafted light chain was assessed by co-expression in COS cells, of genes for 
the combinations: 
B72.3 cH/B72,3 cL 
and B72.3 cH/B72.3 gL 

Supernatants were assayed for antibody concentration and for the ability to bind to microtitre plates 
coated with mucin. The results obtained indicated that, in combination with the B72.3 cH chain, B72.3 cL 
and B72.3 gL had similar binding properties. 

Comparison of the murine B72.3 and REI light chain amino acid sequences reveals that the residues 
are identical at positions 46, 58 and 71 but are different at position 48. Thus changing the human residue to 
the donor mouse residue at position 48 may further improve the binding characteristics of the CDR-grafted 
Light chain, (872.3 gL) in accordance with the present invention, 
(b) B72.3 heavy chain 
i. Choice of framework 

At the outset it was necessary to make a choice of human framework. Simply put. the question was as 
follows: Was it necessary to use the framework regions from an antibody whose crystal structure was 
known or could the choice be made on some other criteria? 

For B72.3 heavy chain, it was reasoned that, while knowledge of structure was important, transfer of 
the CDRs from mouse to human frameworks might be facilitated If the overall homology between the 
donor and receptor frameworks was maximised. Comparison of the B72.3 heavy chain sequence with 
those in Kabat (ref. 4) for human heavy chains showed clearly that B72.3 had poor homology for KOL 
and NEWM (for which crystal structures are available) but was very homologous to the heavy chain 
for EU. 

On this basis. EU was chosen for the CDR-grafting and the following residues transferred as CDRs. 
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CDR 


Residues 


Number 




1 


27-36 


2 


50-63 


3 


93-102 



10 



IS 



20 



25 



30 



35 



40 



45 



Also it was noticed that tlie FR4 region of EU was unlike that of any other human (or mouse) antibody. 
Consequently, in the grafted heavy chain genes this was also changed to produce a "consensus" 
human sequence. (Preliminary experiments showed that grafted heavy chain genes containing the EU 
FR4 sequence expressed very poorly in transient expression systems.) 

ii. Results with grafted heavy chain genes 

Expression of grafted heavy chain genes containing all human framework regions with either gL or cL 
genes produced a grafted antibody with little ability to bind to mucin. The grafted antibody had about 
1% the activity of the chimeric antibody. In these experiments, however, it was noted that the activity 
of the grafted antibody could be increased to - 10% of B72.3 by exposure to pHs of 2-3.5. 
This observation provided a clue as to how the activity of the grafted antibody could be improved 
without acid treatment. It was postulated that acid exposure brought about the protonation of an acidic 
residue (pKa of aspartic acid = 3.86 and of glutamine acid = 4.25) which in turn caused a change in 
structure of the CDR loops, or allowed better access of antigen. 

From comparison of the sequences of B72.3 (ref. 13) and EU (refs. 4 and 5), it was clear that, in going 
from the mouse to human frameworks, only two positions had been changed in such a way that acidic 
residues had been introduced. These positions are at residues 73 and 81 , where K to E and Q to E 
changes had been made, respectively. 

Which of these positions might be important was determined by examining the crystal structure of the 
KOL antibody. In KOL heavy chain, position 81 is far removed from either of the CDR loops. 
Position 73, however, is close to both CDRs 1 and 3 of the heavy chain and, in this position it was 
possible to envisage that a K to E change in this region could have a detrimental effect on antigen 
binding. 

iii. Framework changes in B72.3 gH gene 

On the basis of the above analysis, E73 was mutated to a lysine (K). It was found that this change had 
a dramatic effect on the ability of the grafted Ab to bind to mucin. Further the ability of the grafted 
B72.3 produced by the mutated gH/gL combination to bind to mucin was similar to that of the B72.3 
chimeric antibody. 

iv. Other framework changes 

In the course of the above experiments, other changes were made in the heavy chain framework 
regions. Within the accuracy of the assays used, none of the changes, either alone or together, 
appeared beneficial. 
V. Other 

All assays used measured the ability of the grafted Ab to bind to mucin and, as a whole, indicated 
that the single framework change at position 73 is sufficient to generate an antibody with similar 
binding properties to B72.3. 

Comparison of the B72.3 murine and EU heavy chain sequences reveals that the mouse and human 
residues are identical at positions 23. 24, 71 and 78. 

Thus the mutated CDR-grafted B72.3 heavy chain corresponds to a preferred embodiment of the 
present invention. 



EXAIVIPLE 4 



50 



CDR-GRAFTING OF A MURINE ANTI-ICAM-1 MONOCLONAL ANTIBODY 



55 



A murine antibody, R6-5-D6 (EP 0314863) having specificity for Intercellular Adhesion Molecule 1 
(ICAM-1) was CDR-grafted substantially as described above in previous examples. This work is described in 
greater detail in co-pending application, British Patent Application No. 9009549.8. the disclosure of which is 
incorporated herein by reference. 

The human EU framework was used as the acceptor framework for both heavy and light chains. The CDR- 
grafted antibody currently of choice is provided by co-expression of grafted light chain gL221A and grafted 
heavy chain gH341D which has a binding affinity for ICAM 1 of about 75% of that of the corresponding 
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mouse-human chimeric antibody. 
LIGHT CHAIN 

5 gL221A has murine CDRs at positions 24-34 (CDR1). 50-56 (CDR2) and 89-97 (CDR3). In addition 

several framework residues are also the murine amino acid. These residues were chosen after consideration 
of the possible contribution of these residues to domain packing and stability of the conformation of the 
antigen binding region. The residues which have been retained as mouse are at positions 2, 3. 46 (?), 60, 
84, 85 and 87. Comparison of the murine antl-ICAM 1 and human EU light chain amino acid sequences 

10 reveals that the murine and human residues are identical at positions 46, 58 and 71 . 

HEAVY CHAIN 

gH341D has murine CDRs at positions 26-35 {CDR1), 50-56 (CDR2) and94-100B (CDR3). In addition 
15 murine residues were used in gH341D at positions 24. 48, 69, 71, 73, 80, 88 and 91. Comparison of the 
murine anti-ICAM 1 and human EU heavy chain amino acid sequences are identical at positions 23, 49 and 
78. 

EXAMPLE 5 

20 

CDR-Graftinq of murine anti'-TNFa antibodies 

25 A number of murine anti-TNFa monoclonal antibodies were CDR-grafted substantially as described 
above in previous examples. These antibodies include the murine monoclonal antibodies designated 61 
E71, hTNFI. hTNF3 and 101.4 A brief summary of the CDR-grafting of each of these antibodies is given 
below. 

30 61E71 

A similar analysis as described above (Example 1, Section 12.1.) was done for 61E71 and for the heavy 
chain 10 residues were identified at 23. 24. 48, 49, 68. 69, 71. 73. 75 and 88 as residues to potentially 
retain as murine. The human frameworks chosen for CDR-grafting of this antibody, and the hTNF3 and 

35 101.4 antibodies were RE1 for the light chain and KOL for the heavy chain. 

Three genes were built, the first of which contained 23. 24. 48, 49. 71 and 73 [gH341(6)] as murine 
residues. The second gene also had 75 and 88 as murine residues [gH341(8)] while the third gene 
additionally had 68, 69, 75 and 88 as murine residues [gH341(10)]. Each was co-expressed with gL221, the 
minimum grafted light chain (CDRs only). The gL221/gH341(6) and gL221/gH341(8) antibodies both bound 

40 as well to TNF as murine 61E71. The gL221/gH341(10) antibody did not express and this combination was 
not taken further. 

Subsequently the gL221/gH341(6) antibody was assessed in an L929 cell competition assay in which the 
antibody competes against the TNF receptor on L929 cells for binding to TNF in solution. In this assay the 
gL221/gH341(6) antibody was approximately 10% as active as murine 61E71. 

45 

hTNFI 

hTNFI is a monoclonal antibody which recognises an epitope on human TNF- . The EU human 
framework was used for CDR-grafting of both the heavy and light variable domains. 

Heavy Chain 

In the CDR-grafted heavy chain (ghTNFI) mouse CDRs were used at positions 26-35 (CDR1). 50-65 
(CDR2) and 95-102 (CDR3). Mouse residues were also used in the frameworks at positions 48. 67, 69, 71, 
55 73. 76, 89. 91. 94 and 108. Comparison of the TNF1 mouse and EU human heavy chain residues reveals 
that these are identical at positions 23. 24, 29 and 78. 
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Light Chain 

In the CDR-grafted light chain (gLhTNFI) mouse CDRs wre used at positions 24-34 (CDR1). 50-56 
(CDR2) and 89-97 (CDR3). In addition mouse residues were used in the frameworks at positions 3, 42, 48, 
5 49, 83, 106 and 108. Comparison of the hTNFI mouse and EU human light chain residues reveals that 
these are identical at positions 46, 58 and 71 . 

The grafted hTNFI heavy chain was co-expressed with the chimeric light chain and the binding ability 
of the product compared with that of the chimeric light chain/chimeric heavy chain product in a TNF binding 
assay. The grafted heavy chain product appeared to have binding ability for TNF slightly better than the 
10 fully chimeric product. 

Similarly, a grafted heavy chain/grafted light chain product was co-expressed and compared with the 
fully chimeric product and found to have closely simitar binding properties to the latter product. 

hTNF3 

IS 

hTNF3 recognises an epitope on human TNF-a. The sequence of hTNF3 shows, only 21 differences 
compared to 61E71 in the light and heavy chain variable regions, 10 in the light chain (2 in the CDRs at 
positions 50. 96 and 8 in the framework at 1, 19, 40, 45. 46, 76, 103 and 106) and 11 in the heavy chain (3 
in the CDR regions at positions 52. 60 and 95 and 8 in the framework at 1. 10, 38, 40. 67. 73, 87 and 105). 

20 The light and heavy chains of the 61E71 and hTNF3 chimeric antibodies can be exchanged without loss of 
activity in the direct binding assay. However 61E71 is an order of magnitude less able to compete with the 
TNF receptor on L929 cello for TNF-a compared to hTNF3. Based on the 61E71 CDR grafting data gL221 
o and gH341{ + 23, 24, 48, 49 71 and 73 as mouse) genes have been built for hTNF3 and tested and the 
resultant grafted antibody binds well to TNF-a. but competes very poorly in the L929 assay. It is possible 

25 that in this case also the framework residues identified for OKT3 programme may improve the competitive 
binding ability of this antibody. 

101.4 

30 101.4 is a further murine monoclonal antibody able to recognise human TNF-a. The heavy chain of this 
antibody shows good homology to KOL and so the CDR-grafting has been based on RE1 for the light chain 
and KOL for the heavy chain. Several grafted heavy chain genes have been constructed with conservative 
choices for the CDR's (gH341) and which have one or a small numt^er of non-CDR residues at positions 73. 
78 or 77-79 inclusive, as the mouse amino acids. These have been co-expressed with cL or gL221. In all 

36 cases binding to TNF equivalent to the chimeric antibody is seen and when co-expressed with cL the 
resultant antibodies are able to compete well in the L929 assay. However, with gL221 the resultant 
antibodies are at least an order of magnitude less able to compete for TNF against the TNF receptor on 
L929 cells. 

Mouse residues at other positions in the heavy chain, for example, at 23 and 24 together or at 76 have 
40 been demonstrated to provide no improvement to the competitive ability of the grafted antibody in the L929 
assay. 

A number of other antibodies including antibodies having specificity for interleukins e.g. IL1 and cancer 
markers such as carcinoembryonic antigen (CEA) e.g. the monoclonal antibody A5B7 (ref. 21), have been 
successfully CDR-grafted according to the present invention. 
45 It will be appreciated that the foregoing examples are given by way of illustration only and are not intended 
to limit the scope of the claimed invention. Changes and modifications may be made to the methods 
described whilst still falling within the spirit and scope of the invention. 
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SEQUENCE LISTING 



20 



30 



35 



(1) GENERAL INFORMATION: 



5 (i) APPLICANT: 

• (A) NAME: CELLTECH LIMITED 

(B) STREET: 216 BATH ROAD 

(C) CITY: SLOUGH 

(D) STATE: BERKSHIRE 

(E) COUNTRY: UNITED KINGDOM 

70 (F) POSTAL CODE (ZIP) : SLl 4 EN 

(G) TELEPHONE: 0753 534655 

(H) TELEFAX: 0753 536632 

(I) TELEX: 848473 

(ii) TITLE OF INVENTION: HUMANISED ANTIBODIES 
IS (iii) NUMBER OF SEQUENCES: 33 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

TCCAGATGTT AACTGCTCAC 
20 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



46 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

CAGGGGCCAG TGGATGGATA GAC 
23 

(2) INFORMATION FOR SEQ ID NO: 3: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 



55 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: lineal 

(ii) MOLECULE TYPE: protein ' 
^ (V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

10 Leu Glu lie Asn Arg Thr Val Ala Ala 

1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



50 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAATTCCCAA AGACAAAATG GATTTTCAAG TGCAGATTTT CAGCTTCCTG CTAATCAGTG 
60 

CCTCAGTCAT AATATCCAGA GGACAAATTG TTCTCACCCA GTCTCCAGCA AT.CATGTCTG 
120 

CATCTCCAGG GGAGAAGGTC ACCATGACCT GCAGTGCCAG CTCAAGTGTA AGTTACATGA 
180 

30 ACTGGTACCA GCAGAAGTCA GGCACCTCCC CCAAAAGATG GATTTATGAC ACATCCAAAC 

240 

TGGCTTCTGG AGTCCCTGCT CACTTCAGGG GCAGTGGGTC TGGGACCTCT TACTCTCTCA 
300 

35 CAATCAGCGG CATGGAGGCT GAAGATGCTG CCACTTATTA CTGCCAGCAG TGGAGTAGTA 

360 

ACCCATTCAC GTTCGGCTCG GGGACAAAGT TGGAAATAAA CCGGGCTGAT ACTGCACCAA 
/ 420 

CTGTATCCAT CTTCCCACCA TCCAGTGAGC AGTTAACATC TGGAGGTGCC TCAGTCGTGT 
40 480 

GCTTCTTGAA CAACTTCTAC CCCAAAGACA TCAATGTCAA GTGGAAGATT GATGGCAGTG 
540 

AACGACAAAA TGGCGTCCTG AACAGTTGGA CTGATCAGGA CAGCAAAGAC AGCACCTACA 
45 600 

GCATGAGCAG CACCCTCACG TTGACCAAGG ACGAGTATGA ACGACATAAC AGCTATACCT 
660 



GTGAGGCCAC TCACAAGACA TCAACTTCAC CCATTGTCAA GAGCTTCAAC AGGAATGAGT 
720 

GTTAGAGACA AAGGTCCTGA GACGCCACCA CCAGCTCCCA GCTCCATCCT ATCTTCCCTT 
780 
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CTAAGGTCTT GGAGGCTTCC CCACAAGCGC TTACCACTGT TGCGGTGCTC TAAACCTCCT 840 

CCCACCTCCT TCTCCTCCTC CTCCCTTTCC TTGGCTTTTA TC-ATGCTAAT liTTTGQXQAA 900 

AATATTCAAT AAAGTGAGTC TTTGCCTTGA AAAAAAAAAA AAA 94 3 

5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

70 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asp Phe Val lie Phe Ser Phe Leu Leu lie Ser Ala Ser Val lie 
15 10 15 

lie Ser Arg Gly Gin He Val. Leu Thr Gin Ser Pro Ala He Met Ser 
20 2 0 2 5 3 0 

Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser Ser 
35 40 45 



Val Ser Tyr Met Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys 
50 55 60 

Arg Trp He Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala His 
65 70 75 80 

Phe Arg Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He Ser Gly 

85 90 95 

Met Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser 

100 105 110 

Asn Pro Phe Thr Phe Gly Ser Gly Thr Lys Leu Glu He Asn Arg Ala 
115 120 125 

Asp Thr Ala Pro Thr Val Ser He Phe Pro Pro Ser Ser Glu Gin Leu 
130 135 140 

Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn Phe Tyr Pro 
145 150 155 160 

40 Lys Asp He Asn Val Lys Trp Lys He Asp Gly Ser Glu Arg Gin Asn 

165 170 175 

Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser Thr Tyr 

180 185 190 



Ser Met Ser Ser Thr Leu Thr Leu Thr Lys Asp Glu Tyr Glu Arg His 
195 200 205 

Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser Pro He 
210 215 220 

Val Lys Ser Phe Asn Arg Asn Glu Cys 
50 225 230 

(2) INFORMATION FOR SEQ ID NO: 6: 
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15 



20 



25 



30 
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50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GAATTCCCCT 


CTCCACAGAC 


ACTGAAAACT 


CTGACTCAAC 


ATGGAAAGGC 


ACTGGATCTT 


60 


TCT ACTC CTG 


TTGTCAGTAA 


CTGCAGGTGT 


CCACTCCCAG 


GTCCAGCTGC 


AGCAGTCTGG 


120 


GGCTGAACTG 


GCAAGACCTG 


GGGCCTCAGT 


GAAGATGTCC 


TGCAAGGCTT 


CTGGCTACAC 


180 


CTTTACTAGG 


TACACGATGC 


ACTGGGTAAA 


ACAGAGGCCT 


GGACAGGGTC 


TGGAATGGAT 


240 


TGGATACATT 


AATCCTAGCC 


GTGGTTATAC 


TAATTACAAT 


CAGAAGTTCA 


AGGACAAGGC 


300 


CACATTGACT 


ACAGACAAAT 


CCTCCAGCAC 


AGCCTACATG 


CAACTGAGCA 


GCCTGACATC 


3 60 


TGAGGACTCT 


GCAGTCTATT 


ACTGTGCAAG 


ATATTATGAT 


GATCATTACT 


GCCTTGACTA 


420 


CTGGGGCCAA 


GGCACCACTC 


TCACAGTCTC 


CTCAGCCAAA 


ACAACAGCCC 


CATCGGTCTA 


480 


TCCACTGGCC 


CCTGTGTGTG 


GAGATACAAC 


TGGCTCCTCG 


GTGACTCTAG 


GATGCCTGGT 


540 


CAAGGGTTAT 


TTCCCTGAGC 


CAGTGACCTT 


GACCTGGAAC 


TCTGGATCCC 


TGTCCAGTGG 


600 


TGTGCACACC 


TTCCCAGCTG 


TCCTGCAGTC 


TGACCTCTAC 


ACCCTCAGCA 


GCTCAGTGAC 


660 


TGTAACCTCG 


AGCACCTGGC 


CCAGCCAGTC 


CATCACCTGC 


AATGTGGCCC 


ACCCGGCAAG 


720 


CAGCACCAAG 


GTGGACAAGA 


AAATTGAGCC 


CAGAGGGCCC 


ACAATCAAGC 


CCTGTCCTCC 


780 


ATGCAAATGC 


CCAGCACCTA 


ACCTCTTGGG 


TGGACCATCC 


GTCTTCATCT 


TCCCTCCAAA 


840 


GATCAAGGAT 


GTACTCATGA 


TCTCCCTGAG 




AV^nXu Xu Xuu 


X \a\j X uuA X u X 


Qnn 

9 WW 


GAGCGAGGAT 


GACCCAGATG 


TCCAGATCAG 


CTGGTTTGTG 


AACAACGTGG 


AAGTACACAC 


960 


AGCTCAGACA 


CAAACCCATA 


GAGAGGATTA 


CAACAGTACT 


CTCCGGGTGG 


TCAGTGCCCT 


1020 


CCCCATCCAG 


CACCAGGACT 


GGATGAGTCC 


CAAGGAGTTC 


AAATGCAAGG 


TCAACAACAA 


1080 


AGACCTCCCA 


GCGCCCATCG 


AGAGAACCAT 


CTCAAAACCC 


AAAGGGTCAG 


TAAGAGCTCC 


1140 


ACAGGTATAT 


GTCTTGCCTC 


CACCAGAAGA 


AGAGATGACT 


AAGAAACAGG 


TCACTCTGAC 


1200 


CTGCATGGTC 


ACAGACTTCA 


TGCCTGAAGA 


CATTTACGTG 


GAGTGGACCA 


ACAACGGGAA 


1260 


AACAGAGCTA 


AACTACAAGA 


ACACTGAACC 


AGTCCTGGAC 


TCTGATGGTT 


CTTACTTCAT 


1320 


GTACAGCAAG 


CTGAGAGTGG 


AAAAGAAGAA 


CTGGGTGGAA 


AGAAATAGCT 


ACTCCTGTTC 


1380 


AGTGGTCCAC 


GAGGGTCTGC 


ACAATCACCA 


CACGACTAAG 


AGCTTCTCCC 


GGACTCCGGG 


1440 


TAAATGAGCT 


CAGCACCCAC 


AAAACTCTCA 


GGTCCAAAGA 


GACACCCACA 


CTCATCTCCA 


1500 


TGCTTCCCTT 


GTATAAATAA 


AGCACCCAGC 


AATGCCTGGG 


ACCATGTAAA 


AAAAAAAAAA 


1560 


AAAGGAATTC 












1570 


(2) INFORMATION FOR SEQ ID NO: 7: 











55 



31 



EP000620276 [ fi{e:/A\clcwas03\firmdata\ip\FolevPat\PatentDQCuments\EP000620276.CPCl 



Page 32 of 66 



EP 0 620 276 A1 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(i±) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Glu Arg His Trp He Phe Leu Leu Leu Leu Ser Val Thr Ala Gly 
15 10 15 

Val His Ser Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg 

20 25 30 

/5 Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe 

35 40 45 

Thr Arg Tyr Thr Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu 
50 55 60 



10 



20 



Glu Trp He Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn 
65 70 75 80 

Gin Lys Phe Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser 

85 90 95 

Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
25 100 105 110 

Tyr Tyr Cys Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp 
115 120 125 



30 



35 



Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr Ala Pro 
130 135 140 

Ser Val Tyr Pro Leu Ala Pro Val Cys Gly Asp Thr Thr Gly Ser ser 
145 150 155 160 

Val Thr Leu Gly Cys Leu Val Lys Gly Tyr Phe Pro Glu Pro Val Thr 

165 170 175 

Leu Thr Trp Asn Ser Gly Ser Leu Ser Ser Gly Val His Thr Phe Pro 

180 185 190 

Ala Val Leu Gin Ser Asp Leu Tyr Thr Leu Ser Ser Ser Val Thr Val 
195 200 205 

40 Thr Ser Ser Thr Trp Pro Ser Gin Ser He Thr Cys Asn Val Ala His 

210 215 220 

Pro Ala Ser Ser Thr Lys Val Asp Lys Lys He Glu Pro Arg Gly Pro 
225 230 235 240 



45 



Thr He Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu Leu 

245 250 255 

Gly Gly Pro Ser Val Phe He Phe Pro Pro Lys He Lys Asp val Leu 

260 265 270 

Met He Ser Leu Ser Pro He Val Thr Cys Val Val Val Asp Val Ser 

50 275 280 285 

Glu Asp Asp Pro Asp Val Gin He Ser Trp Phe Val Asn Asn Val Glu 
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290 295 300 

• • ^ 

Val His Thr Ala Gin Thr Gin Thr His Arg Glu Asp-O^yr -Asn- Gtr Thr 
305 310 315 320 

5 Leu Arg Val Val Ser Ala Leu Pro lie Gin His Gin Asp Trp Met Ser 

325 330 335 

Gly Lys Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro 

340 345 350 

70 lie Glu Arg Thr lie Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gin 

355 360 365 

Val Tyr Val Leu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gin Val 
370 375 380 

Thr Leu Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp lie Tyr Val 
'5 385 390 395 400 

Glu Trp Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu 

405 410 415 

Pro Val Leu Asp Ser . Asp Gly Ser Tyr Phe Met Tyr Ser Lys Leu Arg 
20 420 425 430 

Val Glu Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val 
435 440 445 



25 



Val His Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg 
450 455 460 

Thr Pro Gly Lys 
465 



(2) INFORMATION FOR SEQ ID NO: 8: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gin lie Val Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly 
40 1 5 10 15 

Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 



45 



50 



Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys Arg Trp lie Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu Thr lie Ser Gly Met Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 

85 90 95 
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Phe Gly Ser Gly Thr Lys Leu Glu He Asn Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 9: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 9: 



15 



Asp He Gin Met: Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr He Thr Cys Gin Ala Ser Gin Asp He He Lys Tyr 

20 25 30 

Leu Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Leu Leu He 
20 35 40 45 

Tyr Glu Ala Ser Asn Leu Gin Ala Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 



25 



30 



35 



Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 
65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Gin Gin Tyr Gin Ser Leu Pro Tyr 

85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala 
15 10 15 

Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Thr Asn Gin Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 



45 



50 



55 
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Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 

85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys; Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

5 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 11: 

70 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



75 



25 



30 



3S 



45 



50 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
20 1 5 10 15 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe lie Phe Ser Ser Tyr 

20 25 30 



Ala Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala lie lie Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 

Ala Arg Asp Gly Gly His Gly Phe Cys Ser Ser Ala Ser Cys Phe Gly 

100 105 110 

Pro Asp Tyr Trp Gly Gin Gly Thr Pro Val Thr Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala 
15 10 15 

Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 



55 



35 



EP000620276 ffile:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\EP000620276.CPC l 



Page 36 of 66 



EP 0 620 276 A1 



10 



Thr Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp lie 
35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr' Thr As n Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 

85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 



76 



20 



30 



35 



40 



45 



Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Arg Phe Thr lie Ser Arg Asp Asn ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gin Val Gin Leu Val Gin Ser Gly Gly- Gly MaX, Val .Gin Prxi;(;iy Arg 
1 5 10 - 15 

5 Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

7Q Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 

50 55 60 

Lys Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
15 85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 



20 



40 



46 



50 



Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
35 20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 



Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

Thr Thr Leu Thr Val Ser Ser ' 
115 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

10 Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 

15 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 



75 



Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Asn Thr Ala Phe 
20 65 70 75 80 

Leu Gin Met: Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 



25 



Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 17: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



45 



50 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
40 1 5 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 



Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Ala Phe 
65 70 75 80 

Leu Gin Met Asp ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 



55 
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Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 ll'O 

Thr Thr Leu Thr Val Ser Ser 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

'® (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 



25 



30 



35 



Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Asn Thr Leu Phe 

65 70 75 80 

■ - .- • 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

50 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 
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10 



15 



20 



25 



30 



35 



50 



Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 . 60 

Lys Asp Arg Phe Thr lie Ser Arg Asp. Asn ^er Lys Asn Ttir Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

100 105 110 

40 Thr Thr Leu Thr Val Ser Ser 

115 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



55 
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Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
1 5 10 .15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Atg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
50 55 60 

Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

85 90 95 

^5 Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 

100 105 110 

Thr Leu Thr Val Ser Ser 
115 



10 



20 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

36 Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 

35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
50 55 60 



Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 
45 100 105 110 

Thr Leu Thr Val ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 23; 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 



55 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Girl Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

'5 Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 

50 55 60 

Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 



10 



20 



Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 

100 105 110 

Thr Leu Thr Val Ser Ser 
25 115 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
50 55 60 

Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys Ala 

85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 

100 105 110 



55 
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Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 25: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

_ Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 

1 5 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
20 35 40 4 5 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 

50 55 60 



25 



30 



35 



40 



45 



50 



Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys Ala 

85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 

100 105 110 

Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 

20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 

35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
50 55 60 
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10 



Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Asn Thr Ala Phe Leu 
65 70 ISf •■ 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val'T/r Phe Cys Ala 

85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 

100 105 110 

Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 



75 



25 



30 



Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe lie Phe Ser Ser Tyr 

20 25 30 

Ala Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala lie lie Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

85 90 95 

Ala Arg Asp Gly Gly His Gly Phe Cys Ser Ser Ala Ser Cys Phe Gly 

100 105 110 

Pro Asp Tyr Trp Gly Gin Gly Thr Pro Val Thr Val Ser Ser 
115 120 125 

^ (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



35 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Gin lie Val Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly 
15 10 15 
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Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

2 0 25 \ 30' 

Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys A^g Trp He Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu Thr He Ser Gly Met Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 

85 90 95 

Phe Gly Ser Gly Thr Lys Leu Glu lie Asn Arg 

100 105 

^5 (2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 



26 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr He Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Leu Leu He Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro Glu 
65 70 75 80 

Asp He Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 

85 90 95 

40 Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 30: 



35 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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10 



IS 



20 



25 



30 



35 



40 



45 



Gin lie Val Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
• 1 ' 5 10 15 

Asp Arg Val Thr lie Thr Cys Ser ^la Ser Ser Ser Val S©r Tyr Met 

20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Arg Trp lie Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr lie Ser Ser Leu Gin Pro Glu 
65 70 75 80 

Asp lie Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 

85 90 95 

Phe Gly Gin Gly Thr Lys Leu Gin lie Thr Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gin lie Val Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr lie Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Arg Trp lie Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr lie Ser Ser Leu Gin Pro Glu 
65 70 75 80 

Asp lie Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 

85 90 95 

Phe Gly Gin Gly Thr Lys Leu Gin lie Thr Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Asp lie Gin Met Thr Gin Ser Pro Seir Ser r*eu S-ar ZAla S'eri s/f l Gly 
1 5 lu. 1^ 

Asp Arg Val Thr lie Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Arg Trp lie Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro Glu 
65 70 75 80 

Asp He Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 

85 90 95 

Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 
26 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr He Thr Cys Gin Ala Ser Gin Asp He He Lys Tyr 

20 25 30 

Leu Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Leu Leu He 
40 3 5 40 4 5 

Tyr Glu Ala Ser Asn Leu Gin Ala Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 



Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 

65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Gin Gin Tyr Gin Ser Leu Pro Tyr 

85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 

100 105 



Claims 

1. A CDR-grafted antibody heavy chain having a variable region donriain comprising acceptor framework 
and donor antigen binding regions wherein the framework comprises donor residues at at least one of 
positions 6, 23 and/or 24, 48 and/or 49. 71 and/or 73, 75 and/or 76 and/or 78 and 88 and/or 91 . 
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2. A CDR-grafted heavy chain according to Claim 1 connprising donor residues at positions 23. 24. 49. 71 . 
73 and 78, or at positions 23. 24 and 49. 

3. A CDR-grafted heavy chain according to Claim 2 comprising donor residues at positions 2. 4, 6, 25, 36. 
5 37, 39, 47, 48. 93, 94, 103, 104. 106 and 107. 

4. A CDR-grafted heavy chain according to Claim 2 or 3, comprising donor residues at one. some or all of 

positions: 
1 and 3, 

10 69 (if 48 is different between donor and acceptor), 
38 and 46 (if 48 is the donor residue). 
67. 

82 and 18 (if 67 Is the donor residue). 
91, and 

15 any one or more of 9, 11, 41, 87, 108, 110 and 112. 

5. A CDR-grafted heavy chain according to any of the preceding comprising donor CDRs at positions 26- 
35, 50-65 and 95-100. 

20 6. A CDR-grafted antibody light chain having a variable region domain comprising acceptor framework 
and donor antigen binding regions wherein the framework comprises donor residues at at least one of 
positions 1 and/or 3 and 46 and/or 47. 



25 



7. A CDR-grafted light chain according to Claim 6 comprising donor residues at positions 46 and 47. 

a A CDR-grafted antibody light chain having a variable region domain comprising acceptor framework 
and donor antigen binding regions wherein the framework comprises donor residues at at least one of 
positions 46, 48, 58 and 71. 

30 9. A CDR-grafted light chain according to Claim 8 comprising donor residues at positions 46. 48, 58 and 
71. 

10. A CDR-grafted light chain according to Claim 8 or 9. comprising donor residues at positions 2, 4, 6, 35. 
36. 38. 44. 47. 49. 62, 64-69. 85. 87, 98. 99, 101 and 102. 

35 

11. A CDR-grafted light chain according to Claim 9 or 10, comprising donor residues at one, some or all of 
positions: 

1 and 3. 
63. 

40 60 (if 60 and 54 are able to form a potential saltbridge), 
70 (if 70 and 24 are able to form a potential saltbridge). 
73 and 21 (if 47 is different between donor and acceptor), 
37 and 45 (if 47 if different between donor and acceptor), and 
any one or more of 10. 12. 40. 83. 103 and 105. 

45 

12. A CDR-grafted light chain according to any one of Claims 6-11, comprising donor CDRs at positions 
24-34. 50-56 and 89-97. 

13. A CDR-grafted antibody molecule comprising at least one CDR-grafted heavy chain according to any 
50 one of Claims 1-5 and at least one CDR-grafted light chain according to any one of Claims 6-12. 

14. A CDR-grafted antibody molecule according to Claim 13, which is a site-specific antibody molecule. 

15. A CDR-grafted antibody molecule according to Claim 13 which has specificity for an Interleukin, 
55 hormone or other biologically active compound or a receptor therefor. 

16. A CDR-grafted antibody heavy or light chain or molecule according to any one of the preceding claims 
comprising human acceptor residues and non-human donor residues. 
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5 



10 



20 



25 



17. A DNA sequence which codes for a CDR-grafted heavy chain according to Claim 1 or a CDR-grafted 
light chain according to Claim 6 or Claim 8. 

18. A cloning or expression vector containing a DNA sequence according to Claim 17. 

19. A host cell transformed with a DNA sequence according to Claim 17. 

20. A process for the production of a CDR-grafted antibody sequence according to Claim 17 in a 
transformed host cell. 



21. A process for producing a CDR-grafted antibody product comprising: 

(a) producing in an expression vector an operon having a DNA sequence which encodes an antibody 
heavy chain according to Claim 1; 
and/or 

76 (b) producing In an expression vector an operon having a DNA sequence which encodes a 

complementary antibody light chain according to Claim 6 or Claim 8; 

(c) transfecting a host cell with the or each vector; 
and 

(d) culturing the transfected cell line to produce the CDR-grafted antibody product. 



22. A therapeutic or diagnostic composition comprising a CDR-grafted antibody heavy chain according to 
Claim 1. or a CDR-grafted light chain according to Claim 6 or Claim 8. or a CDR-grafted antibody 
molecule according to Claim 13 in combination with a pharmaceutically acceptable carrier, diluent or 
excipient. 

23. A method of therapy or diagnosis comprising administering an effective amount of a CDR-grafted heavy 
chain according to Claim 1 , or a CDR-grafted light chain according to Claim 6 or Claim 8, or a CDR- 
grafted antibody molecule according to Claim 13 to a human or animal subject. 



30 



35 



40 



45 



50 
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1 


GAATTCCCAA AGACAAAata aattttcaaa tacaaatttit: ci^ad^^t.c<^i:.a 


51 


ctiaa^caota 


cctcaotcat aatatccaaa aoacaaatug ti^ef fra^^erra 


101 


gtctccagca 


atcatgtctg catctccagg ggagaagg^c accatgacct 


151 


gcagtgccag 


ctcaag'tg^a agttacatga ac^ggtacca gcagaag'tca 


201 


ggcacctccc 


ccaaaagatg gatttatgac acatccaaac ^ggcttctgg 


251 


agtccctgct 


cacttcaggg gcag^ggg^c tgggacctct ^ac^ctctca 


301 


caatcagcgg 


catggaggct gaagatgctg ccacttatta ctgccagcag 


351 


tggagtagta 


acccat^cac gt^cggctcg gggacaaag^ ^ggaaataaa 


401 


ccgggctgat: 


actgcaccaa ctg^atccat cttcccacca tccagtgagc 


451 


agttaacatc 


tggaggt-gcc tcagtcg^gt gcttcttgaa caacttctac 


501 


cccaaagaca 


tcaatgtcaa gtggaagatt gatggcagtg aacgacaaaa 


551 


tggcgtcctg 


aacagt1:gga ctgatcagga cagcaaagac agcacctaca 


601 


gcatgagcag 


caccctcacg 'ttgaccaagg acgag'ta^ga acgaca1:aac 


651 


agctatacct 


gtgaggccac ^cacaagaca tcaacttcac ccattgtcaa 


701 


gagcttcaac 


aggaatgagt gtTAGAGACA AAGGTCCTGA GACGCCACCA 


751 


CCAGCTCCCA 


GCTCCATCCT . ATCTTCCCTT CTAAGGTCTT GGAGGCTTCC 


801 


CCACAAGCGC 


tTACCACTGT TGCGGTGCTC tAAACCTCCT CCCACCTCCT 


851 


TCTCCTCCTC 


CTCCCTTTCC TTGGCTTTTA TCATGCTAAT ATTTGCAGAA 


901 


AATATTCAAT 


AAAGTGAGTC TTTGCCTTGA AAAAAAAAAA AAA 

Fig.Ua) 



1 KDFOVOTFSF LLISASVIIS RGOIVI.TQSP AIMSASPGEK VTMTCSASSS 

51 VSYMNWYQQK SGTSPKRWIY DTSKIASGVP AHFRGSGSGT SYSLTISGME 

101 AEDAATYYCQ QWSSNPFTFG SGTKLEINRA DTAPTVSIFP PSSEQLTSGG 

151 ASWCFLNNF YPKDINVKWK IDGSERQNGV LNSWTDQDSK DSTYSMSSTL 

2 01 TLTKDEYERH NSYTCEATHK TSTSPIVKSF NRNEC* 

Fig. Kb) 
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1 GAATTCCCCT CTCCACAGAC ACTGAAAACT CTGACTCAAC ATGGAAAGGC 
51 ACTGGATCTT TCTACT CCTG TTGTCAGTAA CTGCAGGTGT CCACTCC CAG 
101 GTCCAGCTGC AGCAGTCTGG GGCTGAACTG GCAAGACCTG GGGCCTCAGT 
151 GAAGATGTCC TGCAAGGCTT CTGGCTACAC CTTTACTAGG TACACGATGC 
201 ACTGGGTAAA ACAGAGGCCT GGACAGGGTC TGGAATGGAT TGGATACATT 
251 AATCCTAGCC GTGGTTATAC TAATTACAAT CAGAAGTTCA AGGACAAGGC 
301 CACATTGACT ACAGACAAAT CCTCCAGCAC AGCCTACATG CAACTGAGCA 
351 GCCTGACATC TGAGGACTCT GCAGTCTATT ACTGTGCAAG ATATTATGAT 
401 GATCATTACT GCCTTGACTA CTGGGGCCAA GGCACCACTC TCACAGTCTC 
451 CTCAGCCAAA ACAACAGCCC CATCGGTCTA TCCACTGGCC CCTGTGTGTG 
501 GAGATACAAC TGGCTCCTCG GTGACTCTAG GATGCCTGGT CAAGGGTTAT 
551 TTCCCTGAGC CAGTGACCTT GACCTGGAAC TCTGGATCCC TGTCCAGTGG 
601 TGTGCACACC TTCCCAGCTG TCCTGCAGTC TGACCTCTAC ACCCTCAGCA 
651 GCTCAGTGAC TGTAACCTCG AGCACCTGGC CCAGCCAGTC CATCACCTGC 
701 AATGTGGCCC ACCCGGCAAG CAGCACCAAG GTGGACAAGA AAATTGAGCC 
751 CAGAGGGCCC ACAATCAAGC CCTGTCCTCC ATGCAAATGC CCAGCACCTA 
8 01 ACCTCTTGGG TGGACCATCC GTCTTCATCT TCCCTCCAAA GATCAAGGAT 
851 GTACTCATGA TCTCCCTGAG CCCCATAGTC ACATGTGTGG TGGTGGATGT 
901 GAGCGAGGAT GACCCAGATG TCCAGATCAG CTGGTTTGTG AACAACGTGG 
951 AAGTACACAC AGCTCAGACA CAAACCCATA GAGAGGATTA CAACAGTACT 
1001 CTCCGGGTGG TCAGTGCCCT CCCCATCCAG CACCAGGACT GGATGAGTGG 
1051 CAAGGAGTTC AAATGCAAGG TCAACAACAA AGACCTCCCA GCGCCCATCG 
1101 AGAGAACCAT CTCAAAACCC AAAGGGTCAG TAAGAGCTCC ACAGGTATAT 
1151 GTCTTGCCTC CACCAGAAGA AGAGATGACT AAGAAACAGG TCACTCTGAC 
1201 CTGCATGGTC ACAGACTTCA TGCCTGAAGA CATTTACGTG GAGTGGACCA 
1251 ACAACGGGAA AACAGAGCTA AACTACAAGA ACACTGAACC AGTCCTGGAC 
1301 TCTGATGGTT CTTACTTCAT GTACAGCAAG CTGAGAGTGG AAAAGAAGAA 
1351 CTGGGTGGAA AGAAATAGCT ACTCCTGTTC AGTGGTCCAC GAGGGTCTGC 
1401 ACAATCACCA CACGACTAAG AGCTTCTCCC GGACTCCGGG TAAATGAGCT 
1451 CAGCACCCAC AAAACTCTCA GGTCCAAAGA GACACCCACA CTCATCTCCA 
1501 TGCTTCCCTT GTATAAATAA AGCACCCAGC AATGCCTGGG ACCATGTAAA 
1551 AAAAAAAAAA AAAGGAATTC 

Fig. 2(a) 
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OKT 3 HEAVY CHAIN PROTEIN SEQUENCE DEDUCED FROM DNA SEQUENCE 

1 MERHV?IFLLL LSVTAGVHS Q VQLQQSGAEL ARPGASVKMS CKASGYTFTR 

51 YTMHWVKQRP GQGLEWIGYI NPSRGYTNYN QKFKDKATLT TDKSSSTAYM 

101 QLSSLTSEDS AVYYCARYYD DHYCLDYWGQ GTTLTVSSAK TTAPSVYPLA 

151 PVCGDTTGSS VTLGCLVKGY FPEPVTLTWN SGSLSSGVHT FPAVLQSDLY 

201 TLSSSVTVTS STWPSQSITC NVAHPASSTK VDKKIEPRGP TIKPCPPCKC 

251 PAPNIXGGPS VFIFPPKIKD VLMISLSPIV TCWVDVSED DPDVQISWFV 

301 NNVEVHTAQT QTHREDYNST LRWSALPIQ KQDWMSGKEF KCKVNNKDLP 

351 APIERTISKP KGSVRAPQVY VLPPPEEEMT KKQVTLTCMV TDFMPEDIYV 

4 03 EWTNNGKTEL NYKNTEPVLD SDGSYFMYSK LRVEKKNWVZ RNSYSCSWH 

4 51 EGLHNHHTTK SFSRTPGK* 



Fig. 2(b) 



1 . 23 42 

NN N N N N 

RES TYPE SBspSPESssBSbSsSssPSPSPsPSsse*s*p*Pl^ISsSe 
0}Ct3vl QIVLTQSPAIMSASPGEKVTMTCSASS. SVSYMJJWYQQKSGT 

REI DIQMTQSPSSLSASVGDKVTITCQASQDIIKYLNWYQQTPGK 

CDRl (LOOP) ******* 
CDRl (KABAT) *********** 



56 85 

N NN 

RES TYPE ^IsiPpIeesesssSBEsePsPSBSSEsPspsPsseesSPePb 
0)ct3vl SPKRWIYDTSKLASGVPASFRGSGSGTSYSLTISGMEAEDAAT 
REI APKLLIYEASNLQAGVPSRFSGSGSGTDYTZTISSLQPEDIAT 

******* CDR2 (LOOP/KABAT) 



102 108 

RES TYPE PiPIPies**iPIlsPPS?SPSS 
Okt3vl YYCQQWSSNPFTFGSGTKLEINR 
REIvl YYCQQYQSLPYTFGQGTKLQIIR 

• • 

****** CDR3 (LOOP) 

********* CRD3 (KABAT) 



Fig. 3 
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KK K 23 26 32 35 M39 43 

RES TYPE SESPs'SBssS^sSSsSpSpSPsPSEbSBssBePiPIpiesss 

0)Ct:3h QVQLQQSGAEIABPGASVKHSCKASGYTFTRYTMHWVKQRPGQ 

KOL OVQLVESGGGWOPGR SLRLSCS S SGFI FSS YAMYWVRQAPGK 

•> -JO 

****** CDRl (IX)OP) 

***** CDRl (KABAT) 



52a 60 65 N N H 82abc 89 

RES TYPE Ilelppp'ssssssss^ps "pSSsbSpseSsSseSp"pSpsSBssS*ePb 
Okt3vh GLEWIGYINPSRGYTNTNQKFKBKATLTTDKSSSTAYMQLSSLTSEDSAV 
KOL GLEWVftl I WDDGSDQH YADSVKGRFTI SRDHSKNTLZLQMDSLR£EDTGV 

• • • • • • s 

************ CDR2 (LOOP) 

******************* CDR2 (KABAT) 

92 N 107 113 

RES TYPE PiPI£is5Ssiiisssbibi*EIPIP*spSBSS 

Okt 3 vh YYCARYYDDHY CLDYWGQGTTLTVSS 

KOL Y£CARDGGHGFCSSASCFGPDYWGQGTEVTVSS 

***************** CRD3 (KABAT/LOOP) 

Fig. A 
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OKT 3 HEAVY CHAIK CDR GRAFTS 
1. gh3 41 and derivatives 

2 26 35 39 43 

Okt3vh QVQLQQSGAELARPGASVKMSCKASGYTFTRYTMHWVKQRPGQ 

gH3 41 QVQLVESGGGWQPGRSlJU.SCSSSSXIElE3mfiIMVRQAPGK JA178 

gH341A QVQLVfiSGGGWQPGRSLRI-SCKASfiriSUmaiHWVRQAPGK JA185 

gH3 41E QVQLVfiSGGGWQPGRSLRLSqCi^SSXlElEXlllHWVRQAPGK JA198 

gH341* QVQLV2SGGGWQPGRSLRI.SqCASGXlZIEX324HWVRQAPGK JA207 

gH341* QVQLVfiSGGGWQPGRSIJU-Sq^^SSmiErDJHWVRQAPGK JA209 

gH3 41D QVQLVgSGGGWQPGRSLRLSqC^SGXXZaCEXIMHWVRQAPGK JA197 

gH3 41* QVQLV2SGGGWQPGRSIJU>SCK^§GX1EIEX321HWVRQAPGK JA199 

gH341C QVQLV2SGGGWQPGRSLRl.SCJO^SSm2B3a25HWVRQAPGK JA184 

gH3 4 1 * QVQLVgSGGGWQPGRSlJU^SCS^SfiVTFffiXlMHWVRQAPGK JA2 03 

gH341* QVQLVESGGGWQPGRSLRLSCS^^SfiXlOBXHSHWVRQAPGK JA205 

gH341B QVQLVESGGGWQPGRSlJU-SCSSSSm3EX331HWVRQAPGK JA183 

gH341* QVQLVfiSGGGWQPGRSlJU^SCS^SGYTIIEXlIJHWVRQAPGK JA204 

gH341* QVQLVESGGGWQPGRSUlLSCSASSmiEXlllHWVRQAPGK JA206 

gH341* QVQLV2SGGGWQPGRSIJU.SCS^S£YlZIEX32iHWVRQAPGK JA208 
KOL QVQLVESGGGWQPGRSLRLSCSSSGFIFSSYAMYWVRQAPGK 

Fig. 5(i) 
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Fig. 5(ii) 
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